SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: MAERTENS , GEERT 
BOSMAN, FONS 
DE MARTYNOFF, GUY 
BUYSE, MARIE-ANGE 

(ii) TITLE OF INVENTION: PURIFIED HEPATITIS C VIRUS ENVELOPE 
PROTEINS FOR DIAGNOSTIC AND THERAPEUTIC USE 

(iii) NUMBER OF SEQUENCES: 111 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: NIXON & VANDERHYE P.C. 

(B) STREET: 1100 NORTH GLEBE ROAD 

(C) CITY: ARLINGTON 

(D) STATE: VIRGINIA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 22201-4714 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/612,973 

(B) FILING DATE: ll-MAR-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: BYRNE, THOMAS E. 

(B) REGISTRATION NUMBER: 32,205 

(C) REFERENCE/DOCKET NUMBER: 1487-10 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (703) 816-4000 

(B) TELEFAX: (703) 816-4100 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GGCATGCAAG CTTAATTAAT T 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

p^ZCGGGGAGGC CTGCACGTGA TCGAGGGCAG ACACCATCAC CACCATCACT AATAGTTAAT 60 

^TAACTGCA 68 

fU(2) INFORMATION FOR SEQ ID NO: 3: 

SJ (i) SEQUENCE CHARACTERISTICS: 

if, (A) LENGTH: 642 base pairs 

■ "J (B) TYPE: nucleic acid 

""' 4 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: cDNA 

flj (iii) HYPOTHETICAL: NO 

f{ (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..639 



(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1. .636 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA CTG TCC TGT 48 
Met Pro Glv Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 

CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG TCC GGG ATG 96 
Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG TAT GAG GCA 144 



HCVENV.-T 



Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 



GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG CTC GCA GCT 
Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC GTC GAT TTG 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCG GCT GCT CTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 
Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 HO 

TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC TCG CCT CGC 
Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He Ser Pro Arg 
115 120 125 

CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC 
Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

ATA ACA GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT 
He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
145 150 155 160 

ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC 
Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 
165 170 175 

GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG GGC CTC GCC 
Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
180 185 190 

TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA 
Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
195 200 205 

CTC TTT GCT CTC TAATAG 
Leu Phe Ala Leu 
210 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 



m % 



Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Met 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Leu Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He Ser Pro Arg 
115 120 125 

Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Wile Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
03145 150 155 160 

mrhr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro Gin Ala Val 
~i 165 170 175 

P^al Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala 
>* 180 185 190 

Qryr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He Val Met Leu 
iQ 195 200 205 

kLeu Phe Ala Leu 
X t 210 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..792 

(ix) FEATURE: 



. (A) NAME /KEY: mat_peptide 
(B) LOCATION: 1..789 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATG 
Met 
1 


TTG GGT AAG GTC 
Leu Gly Lys Val 
5 


ATC GAT 
He Asp 


ACC 
Thr 


CTT 
Leu 


ACA 
Thr 
10 


TGC 
Cys 


'GGC 
Gly 


TTC 
Phe 


GCC 
Ala 


GAC 
Asp 
15 


CTC 
Leu 


48 


GTG 
Val 


GGG TAC 
Gly Tyr 


ATT 
He 
20 


CCG 
Pro 


CTC 
Leu 


GTC 
Val 


GGC GCC 
Gly Ala 
25 


CCC 
Pro 


CTA 
Leu 


GGG 
Gly 


GGC 
Gly 


GCT 
Ala 
30 


GCC 
Ala 


AGG 
Arg 


96 


GCC 
Ala 


CTG GCG CAT GGC 
Leu Ala His Gly 
35 


GTC CGG 
Val Arg 


GTT 
Val 
40 


CTG 
Leu 


GAG 
Glu 


GAC 
Asp 


GGC 
Gly 


GTG 
Val 
45 


AAC 
Asn 


TAT 
Tyr 


GCA 
Ala 


144 


ACA 
Thr 


GGG AAT TTG 
Gly Asn Leu 
50 


CCC 
Pro 


GGT TGC 
Gly Cys 
55 


TCT 
Ser 


TTC 
Phe 


TCT 
Ser 


ATC 
He 


TTC 
Phe 
60 


CTC 
Leu 


TTG 
Leu 


GCT 
Ala 


TTG 
Leu 


192 


CTG 

O 65 


TCC TGT 
Ser Cys 


CTG 
Leu 


ACC 
Thr 


GTT 
Val 
70 


CCA 
Pro 


GCT 
Ala 


TCC 
Ser 


GCT 
Ala 


TAT 
Tyr 
75 


GAA 
Glu 


GTG 
Val 


CGC 
Arg 


AAC 
Asn 


GTG 
Val 
80 


240 


ft5er 


GGG ATG 
Gly Met 


TAC 
Tyr 


CAT 
His 
85 


GTC 
Val 


ACG 
Thr 


AAC GAC TGC 
Asn Asp Cys 
90 


TCC 
Ser 


AAC 
Asn 


TCA 
Ser 


AGC 
Ser 


ATT 
He 
95 


GTG 
Val 


288 


^TAT 


GAG GCA 
Glu Ala 


GCG 
Ala 
100 


GAC 
Asp 


ATG 
Met 


ATC 
He 


ATG 
Met 


CAC 
His 
105 


ACC 
Thr 


CCC 
Pro 


GGG 
Gly 


TGC 
Cys 


GTG 
Val 
110 


CCC 
Pro 


TGC 
Cys 


336 


□GTT 


CGG GAG 
Arg Glu 
115 


AAC 
Asn 


AAC 
Asn 


TCT 
Ser 


TCC 
Ser 


CGC 
Arg 
120 


TGC 
Cys 


TGG 
Trp 


GTA 
Val 


GCG 
Ala 


CTC 
Leu 
125 


ACC 
Thr 


CCC 
Pro 


ACG 
Thr 


384 


■3CTC 
^Leu 


GCA GCT AGG AAC 
Ala Ala Arg Asn 
130 


GCC 
Ala 


AGC 
Ser 
135 


GTC 
Val 


CCC 
Pro 


ACC 
Thr 


ACG 
Thr 


ACA 
Thr 
140 


ATA 
He 


CGA 
Arg 


CGC 
Arg 


CAC 
His 


432 


* 

GTC 
Val 
145 


GAT TTG CTC GTT 
Asp Leu Leu Val 


GGG GCG 
Gly Ala 
150 


GCT 
Ala 


GCT 
Ala 


TTC 
Phe 


TGT 
Cys 
155 


TCC 
Ser 


GCT 
Ala 


ATG 
Met 


TAC 
Tyr 


GTG 
Val 
160 


480 


GGG 
Gly 


GAC CTC TGC 
Asp Leu Cys 


GGA 
Gly 
165 


TCT 
Ser 


GTC 
Val 


TTC 
Phe 


CTC 
Leu 


GTC 
Val 
170 


TCC 
Ser 


CAG 
Gin 


CTG 
Leu 


TTC 
Phe 


ACC 
Thr 
175 


ATC 
He 


528 


TCG 
Ser 


CCT CGC CGG CAT 
Pro Arg Arg His 
180 


GAG 
Glu 


ACG 
Thr 


GTG CAG GAC 
Val Gin Asp 
185 


TGC 
Cys 


AAT 
Asn 


TGC 
Cys 


TCA 
Ser 
190 


ATC 
He 


TAT 
Tyr 


576 


CCC 
Pro 


GGC CAC 
Gly His 
195 


ATA 
He 


ACG 
Thr 


GGT CAC 
Gly His 


CGT ATG 
Arg Met 
200 


GCT 
Ala 


TGG 
Trp 


GAT 
Asp 


ATG 
Met 
205 


ATG 
Met 


ATG 
Met 


AAC 
Asn 


624 


TGG 
Trp 


TCG CCT 
Ser Pro 
210 


ACA 
Thr 


ACG 
Thr 


GCC 
Ala 


CTG 
Leu 
215 


GTG 
Val 


GTA 
Val 


TCG 
Ser 


CAG 
Gin 


CTG 
Leu 
220 


CTC 
Leu 


CGG 
Arg 


ATC 
He 


CCA 
Pro 


672 



m % 



CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 
Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
225 230 235 240 

GGT CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG ATT 768 
Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu He 
245 250 255 

GTG ATG CTA CTC TTT GCT CCC TAATAG 795 
Val Met Leu Leu Phe Ala Pro 
260 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

©Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
CJ 1 5 10 15 

r^Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
^ 20 25 30 

l ~^Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
H 35 40 45 

flThr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

;S 50 55 60 



^ieu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
^ 65 70 75 80 

m 

SlSer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

Val Asd Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 



HOVENV,-T 



Pro Gly His He Thr Gly His Arg 
195 200 

Trp Ser Pro Thr Thr Ala Leu Val 
210 215 

Gin Ala Val Val Asp Met Val Ala 
225 230 

Gly Leu Ala Tyr Tyr Ser Met Val 
245 

Val Met Leu Leu Phe Ala Pro 
260 



Met Ala Trp Asp Met Met Met Asn 
205 

Val Ser Gin Leu Leu Arg He Pro 
220 

Gly Ala His Trp Gly Val Leu Ala 
235 240 

Gly Asn Trp Ala Lys Val Leu He 
250 255 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..630 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..627 



H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

15 10 15 

ATG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGT GCT GCC AGA 96 

Met Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 144 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC ATT CCA GCT TCC GCT TAT GAG GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 



"PA/ENV.-T 



TCC 
Ser 


GGG 
Gly 


ATG 
Met 


TAC 
Tyr 


CAT 
His 
85 


GTC 
Val 


ACG 
Thr 


AAC 
Asn 


GAC TGC 
Asp Cys 
90 


TCC 
Ser 


AAC 
Asn 


TCA 
Ser 


AGC 
Ser 


ATT 
He 
95 


GTG 
Val 


288 


tat 
Tyr 


Glu 


GCA 

Ala 


GCG 

Ala 
100 


Asp 


ATG 
Met 


ATC 
He 


ATG 
Met 


CAC 
His 
105 


ACC 
Thr 


CCC GGG TGC 
Pro Gly Cys 


GTG 
Val 
110 


CCC 
Pro 


TGC 
Cys 


336 


GTT 

Val 


Arg 


GAG 

Glu 
115 


AAP 

Asn 


aap 
Asn 


TPT 

Ser 


TCC 
Ser 


CGC 
Arg 
120 


TGC 
Cys 


TGG 
Trp 


GTA 
Val 


GCG 
Ala 


CTC 
Leu 
125 


ACC 
Thr 


CCC 
Pro 


ACG 
Thr 


384 


Leu 


Ala 
130 


1 

Ala 


Arg 


aap 
Asn 


GPP 

Ala 


Ser 
135 


GTP 

Val 


CCC 
Pro 


ACT 
Thr 


ACG 
Thr 


ACA 
Thr 
140 


ATA 
He 


PGA 

Arg 


CGC 
Arg 


CAC 
His 




Val 
145 


bni 

Asp 


TTfZ 

Leu 


PTP 

Leu 


Val 


Ciriri 

Gly 
150 


Cirri 

Ala 


GPT 

Ala 


GCT 
Ala 


TTC 
Phe 


TGT 
Cys 
155 


TCC 
Ser 


GCT 
Ala 


ATG 
Met 


TAP 
Tyr 


GTG 
Val 
160 


480 


Gly 


GAT 

Asp 


CTC 
Leu 


TGC 
Cys 


GGA 

Gly 
165 


TCT 

1 V i 

Ser 


GTC 
Val 


TTC 
Phe 


CTC 
Leu 


GTC 
Val 
170 


TCC 
Ser 


CAG 
Gin 


CTG 
Leu 


TTC 
Phe 


ACC 
Thr 
175 


ATC 
He 


528 


frrpG 


CCT 
Pro 


CGC 
Arg 


CGG 
Arg 
180 


CAT 
His 


GAG 
Glu 


ACG 
Thr 


GTG 
Val 


CAG 
Gin 
185 


GAC 
Asp 


TGC 
Cys 


AAT TGC 
Asn Cys 


TCA 
Ser 
190 


ATC 
He 


TAT 
Tyr 


576 


Cpcc 


GGC 
Gly 


CAC 
His 
195 


ATA 
He 


ACA 
Thr 


GGT 
Gly 


CAC 
His 


CGT 
Arg 
200 


ATG 
Met 


GCT 
Ala 


TGG GAT 
Trp Asp 


ATG 
Met 
205 


ATG 
Met 


ATG 
Met 


AAC 
Asn 


624 


= TGG 
QTrp 


TAATAG 
210 




























633 


S {2) 


INFORMATION 


FOR 


SEQ 


ID NO: 8: 




















(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 


















(ii) 


MOLECULE TYPE: 


protein 






















(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 8: 














Met 
i 


Leu Gly 


Lys 


Val 
5 


He Asp 


Thr 


Leu 


Thr Cys 
10 


Gly Phe 


Ala 


Asp 
15 


Leu 




Met 


Gly Tyr 


He 
20 


Pro 


Leu 


Val 


Gly 


Ala 
25 


Pro Leu Gly Gly 


Ala 
30 


Ala 


Arg 




Ala 


Leu 


Ala 
35 


His Gly Val Arg 


Val 
40 


Leu 


Glu Asp Gly Val 
45 


Asn 


Tyr 


Ala 




Thr 


Gly Asn 


Leu Pro Gly Cys 


Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala 


Leu 





50 55 60 
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% 



Leu Ser Cys Leu Thr lie Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

!f!Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
C3 195 200 205 

ftjPrp 



;j(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

3 (A) LENGTH: 483 base pairs 
□ (B) TYPE: nucleic acid 

T (C) STRANDEDNESS: single 

/; (D) TOPOLOGY: linear 

i 

4 (ii) MOLECULE TYPE: cDNA 
: . (iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..480 



(ix) FEATURE: 

(A) NAME/KEY: matjpeptide 

(B) LOCATION: 1. .477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

ATG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCC CTG CTG TCC TGT 48 
Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
15 10 15 



CTG ACC ATA CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG TCC GGG GTG 



96 





Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATA GTG TAT GAG GCA 14 4 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC GTT CGG GAG 192 
Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

GGC AAC TCC TCC CGT TGC TGG GTG GCG CTC ACT CCC ACG CTC GCG GCC 240 
Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 

AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 
Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT CTC 336 
Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 38 4 

Ocys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 

5 115 120 125 

ScGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 4 32 

l^Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
Y* 130 135 140 

WGTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TCC TAATAG 483 

SlVal Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 

s 145 150 155 160 

5*7(2) INFORMATION FOR SEQ ID NO: 10: 

^ (i) SEQUENCE CHARACTERISTICS: 

y3 (A) LENGTH: 159 amino acids 

NJ (B) TYPE: amino acid 

r . (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
1 • 5 10 15 

Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
20 25 30 

Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
35 40 45 

Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 
50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 
65 70 75 80 
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Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



1 (ix) FEATURE: 

(A) NAME /KEY: CDS 

a (B) LOCATION: 1..477 

2 (ix) FEATURE: 

Z ( A ) NAME /KEY: mat_peptide 

(B) LOCATION: 1..474 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



ATG TCC GGT TGC TCT 


TTC 


TCT 


ATC 


TTC 


CTC 


TTG 


GCC CTG CTG TCC TGT 


48 


Met Ser Gly Cys Ser 


Phe 


Ser 


He 


Phe 


Leu 


Leu 


Ala Leu Leu Ser Cys 




1 5 










10 




15 




jCTG ACC ATA CCA GCT 


TCC 


GCT 


TAT 


GAA 


GTG 


CGC 


AAC GTG TCC GGG GTG 


96 


( Leu Thr He Pro Ala 


Ser 


Ala 


Tyr 


Glu 


Val 


Arg 


Asn Val Ser Gly Val 




; 20 






25 






30 




TAC CAT GTC ACG AAC 


GAC 


TGC 


TCC 


AAC 


TCA 


AGC 


ATA GTG TAT GAG GCA 


144 


Tyr His Val Thr Asn 


Asp 


Cys 


Ser 


Asn 


Ser 


Ser 


He Val Tyr Glu Ala 




35 






40 








45 




GCG GAC ATG ATC ATG 


CAC 


ACC 


CCC 


GGG 


TGC 


GTG 


CCC TGC GTT CGG GAG 


192 


Ala Asp Met He Met 


His 


Thr 


Pro 


Gly 


Cys 


Val 


Pro Cys Val Arg Glu 




50 




55 










60 




GGC AAC TCC TCC CGT 


TGC 


TGG 


GTG 


GCG 


CTC 


ACT 


CCC ACG CTC GCG GCC 


240 


Gly Asn Ser Ser Arg 


Cys 


Trp 


Val 


Ala 


Leu 


Thr 


Pro Thr Leu Ala Ala 




65 


70 










75 


80 
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AGG AAC GCC AGC GTC CCC ACA ACG ACA ATA CGA CGC CAC GTC GAT TTG 288 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

CTC GTT GGG GCT GCT GCT TTC TGT TCC GCT ATG TAC GTG GGG GAT GTC 336 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 

100 105 110 

TGC GGA TCT GTT TTC CTT GTT TCC CAG CTG TTC ACC TTC TCA CCT CGC 384 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

CGG CAT CAA ACA GTA CAG GAC TGC AAC TGC TCA ATC TAT CCC GGC CAT 432 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

GTA TCA GGT CAC CGC ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 480 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 



(2) INFORMATION FOR SEQ ID NO: 12: 

Q (i) SEQUENCE CHARACTERISTICS: 
W (A) LENGTH: 158 amino acids 

yp (B) TYPE: amino acid 

m (D) TOPOLOGY: linear 

!ft (ii) MOLECULE TYPE: protein 

W (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Wet Ser Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys 
□ 15 10 15 

;|eu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val Ser Gly Val 
*Z 20 25 30 

%z His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr Glu Ala 
^ 35 40 45 

Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys Val Arg Glu 

50 55 60 

Gly Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr Leu Ala Ala 

65 70 75 80 

Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His Val Asp Leu 
85 90 95 

Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val Gly Asp Leu 
100 105 110 

Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr Phe Ser Pro Arg 
115 120 125 

Arg His Gin Thr Val Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His 
130 135 140 

Val Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
145 150 155 



% 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



{ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..633 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1. .630 



03 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

0&TG CTG GGT AAG GCC ATC GAT ACC CTT ACG TGC GGC TTC GCC GAC CTC 48 

Cllet Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

it 1 5 10 15 

N3TG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

sVal Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

O 20 25 30 

y3CC CTG GCG CAT GGC GTC CGG GTT CTG GAA GAC GGC GTG AAC TAT GCA 14 4 

IjVla Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

l Z 35 40 45 

*jy 

S&CA GGG AAT TTG CCT GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTA 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTA ACC ATT CCA GCT TCC GCT TAC GAG GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 

85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

115 120 125 

CTC GCG GCT AGG AAC GCC AGC ATC CCC ACT ACA ACA ATA CGA CGC CAC 432 
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Leu Ala Ala Arg Asn Ala Ser lie Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 480 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

GGG GAT CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 528 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 576 
Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

TGG TAC TAATAG 640 
Trp Tyr 
210 



Oy (2) INFORMATION FOR SEQ ID NO: 14: 

m (i) SEQUENCE CHARACTERISTICS: 
m (A) LENGTH: 210 amino acids 

C§ (B) TYPE: amino acid 

<2 (D) TOPOLOGY: linear 

N ! (ii) MOLECULE TYPE: protein 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

CI Met Leu Gly Lys Ala He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

in i 5 10 15 

^ Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
N 20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr He Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

V' 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

Leu Ala Ala Arg Asn Ala Ser He Pro Thr Thr Thr He Arg Arg His 
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130 135 140 

Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 

Glv Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
180 185 190 

Pro Gly His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 

Trp Tyr 
210 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



D (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

iZ ATGCCCGGTT GCTCTTTCTC TATCTT 

HJ (2) INFORMATION FOR SEQ ID NO: 16: 

Sj (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
ATGTTGGGTA AGGTCATCGA TACCCT 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
CTATTAGGAC CAGTTCATCA TCATATCCCA 30 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

1 (ii) MOLECULE TYPE: cDNA 
) (iii) HYPOTHETICAL: NO 
\ (iii) ANTI-SENSE: YES 



3 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

JJCTATTACCAG TTCATCATCA TATCCCA 27 
J (2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

ATACGACGCC ACGTCGATTC CCAGCTGTTC ACCATC 36 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
GATGGTGAAC AGCTGGGAAT CGACGTGGCG TCGTAT 36 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 723 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

3 (ii) MOLECULE TYPE: cDNA 

0 (iii) HYPOTHETICAL: NO 
g (iii) ANTI-SENSE : NO 

^ (ix) FEATURE: 

'4 (A) NAME /KEY: CDS 

(B) LOCATION: 1..720 

1 (ix) FEATURE: 

"J. (A) NAME /KEY : mat_peptide 

Z (B) LOCATION: 1..717 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC .GAC CTC 48 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 
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TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



288 



TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 4 32 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 4 80 

Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

QATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 57 6 

i£ H Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
C| 180 185 190 

5^ GTA TCG CAG CTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG 624 

r?Val Ser Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala 

"2 195 200 205 

In 

MgGG GCC CAT TGG GGA GTC CTG GCG GGT CTC GCC TAC TAT TCC ATG GTG 672 
a Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val 
□ 210 215 220 

f^GGG AAC TGG GCT AAG GTT TTG ATT GTG ATG CTA CTC TTT GCT CCC TAATAG 723 

!^ Gly Asn Trp Ala Lys Val Leu He Val Met Leu Leu Phe Ala Pro 

W 2 25 230 235 240 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 239 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 
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Thr 

Leu 
65 


Gly 
50 
Ser 


Asn 
Cys 


Leu 
Leu 


Pro 
Thr 


Gly 

Val 
70 


Cys 
55 
Pro 


Ser Phe Ser 
Ala Ser Ala 


He 

Tyr 
75 


Phe Leu Leu Ala 
60 

Glu Val Arg Asn 


Leu 

Val 
80 


Ser 


Gly 


Met 


Tyr 


His 
85 


Val 


Thr 


Asn Asp Cys 
90 


Ser 


Asn Ser Ser He 
95 


Val 


Tyr 


Glu 


Ala 


Ala 
100 


Asp 


Met 


He 


Met His Thr 
105 


Pro 


Gly Cys Val Pro 
110 


Cys 


Val 


Arg 


Glu 
115 


Asn 


Asn 


Ser 


Ser 


Arg Cys Trp Val 
120 


Ala Leu Thr Pro 
125 


Thr 


Leu 


Ala 
130 


Ala 


Arg 


Asn 


Ala 


Ser 
135 


Val Pro Thr 


Thr 


Thr He Arg Arg 
140 


His 


Val 
145 


Asp 


Ser 


Gin 


Leu 


Phe 
150 


Thr 


He Ser Pro Arg 
155 


Arg His Glu Thr 


Val 
160 


Gin 


Asp 


Cys 


Asn 


Cys 
165 


Ser 


He 


Tyr Pro Gly His 
170 


lie Thr Gly His 
. 175 


Arg 


□Met 


Ala 


Trp 


Asp 
180 


Met 


Met 


Met 


Asn Trp Ser 
185 


Pro 


Thr Thr Ala Leu 
190 


Val 




Ser 


Gin 
195 


Leu 


Leu 


Arg 


He 


Pro Gin Ala 
200 


Val 


Val Asp Met Val 
205 


Ala 


\|Gly 


Ala 
210 


His 


Trp 


Gly 


Val 


Leu 
215 


Ala Gly Leu 


Ala 


Tyr Tyr Ser Met 
220 


Val 


"'Gly 
1,225 


Asn 


Trp 


Ala 


Lys 


Val 
230 


Leu 


He Val Met 


Leu 
235 


Leu Phe Ala Pro 




|^(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 23: 








ffl 
% 


(i) 


SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 561 base pairs 

(B) TYPE: nucleic acid 









(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .558 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1. .555 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

15 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 14 4 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 

-TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 

Q 85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 

21 Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 

^ 100 105 110 

SJgTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 

yl Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 

%J 115 120 125 

jU CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 432 

Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 

130 135 140 

f|| GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 480 

|fi Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 

£j 145 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 

Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 

165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TAATAG 561 

Met Ala Trp Asp Met Met Met Asn Trp 

180 185 



(2) INFORMATION FOR SEQ ID NO: 24: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 185 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 



15 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 . 125 

^eu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

^al Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
fy.45 150 155 160 



S£ln Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
in 165 170 175 

feet Ala Trp Asp Met Met Met Asn Trp 

!L 180 185 

M<2) INFORMATION FOR SEQ ID NO: 25: 

<fi (i) SEQUENCE CHARACTERISTICS: 

£l (A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. . 603 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..600 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 



ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 48 
Met Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
1 5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 96 
Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 144 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 192 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 240 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 288 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

ISTAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 336 
^Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 HO 

fiOGTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 384 
Mval Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
Iff 115 120 125 

' 4 CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 4 32 

1-Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
W 130 135 140 

MeGTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 4 80 

pi Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
ll45 150 155 160 

?!CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 528 
^ J Gln Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 57 6 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 



180 185 190 

GTA TCG CAG CTG CTC CGG ATC CTC TAATAG 
Val Ser Gin Leu Leu Arg He Leu 
195 200 



606 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



Met 
1 


Leu 


Gly 


Lys 


Val 
5 


He 


Asp 


Thr 


Leu 


Thr Cys Gly 
10 


Phe Ala Asp Leu 
15 


Val 


Gly 


Tyr 


He 
20 


Pro 


Leu 


Val 


Gly 


Ala 
25 


Pro Leu Gly 


Gly Ala Ala Arg 
30 


Ala 


Leu 


Ala 
35 


His 


Gly 


Val 


Arg 


Val 
40 


Leu 


Glu Asp Gly 


Val Asn Tyr Ala 
45 


Thr 


Gly 
50 


Asn 


Leu 


Pro 


Gly 


Cys 
55 


Ser 


Phe 


Ser He Phe 
60 


Leu Leu Ala Leu 


Leu 
65 


Ser 


Cys 


Leu 


Thr 


Val 
70 


Pro 


Ala 


Ser 


Ala Tyr Glu 
75 


Val Arg Asn Val 
80 


Ser 


Gly 


Met 


Tyr 


His 
85 


Val 


Thr 


Asn 


Asp 


Cys Ser Asn 
90 


Ser Ser He Val 
95 


Tyr 


Glu 


Ala 


Ala 
100 


Asp 


Met 


He 


Met 


His 
105 


Thr Pro Gly 


Cys Val Pro Cys 
110 


,«yal 


Arg 


Glu 
115 


Asn 


Asn 


Ser 


Ser 


Arg 
120 


Cys 


Trp Val Ala 


Leu Thr Pro Thr 
125 


tfLeu 


Ala 
130 


Ala 


Arg 


Asn 


Ala 


Ser 
135 


Val 


Pro 


Thr Thr Thr 
140 


He Arg Arg His 


Cval 
iA45 


Asp 


Ser 


Gin 


Leu 


Phe 
150 


Thr 


He 


Ser 


Pro Arg Arg 
155 


His Glu Thr Val 
160 




Asp 


Cys 


Asn 


Cys 
165 


Ser 


He 


Tyr 


Pro 


Gly His He 
170 


Thr Gly His Arg 
175 




Ala 


Trp 


Asp 
180 


Met 


Met 


Met 


Asn 


Trp 
185 


Ser Pro Thr 


Thr Ala Leu Val 
190 


'3fal 


Ser 


Gin 
195 


Leu 


Leu 


Arg 


He 


Leu 
200 









(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..633 



(ix) FEATURE: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 1..630 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

ATG TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 
Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

GTG GGG TAG ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 
Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 
Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 
Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

;CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 
Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGO ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 

TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 

CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 

GTC GAT TCC CAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG 
Val Asp Ser Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val 
145 * 150 155 160 

CAG GAC TGC AAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT 
Gin Asp Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg 
165 170 175 

ATG GCT TGG GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG 
Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

GTA TCG CAG CTG CTC CGG ATC GTG ATC GAG GGC AGA CAC CAT CAC CAC 
Val Ser Gin Leu Leu Arg lie Val He Glu Gly Arg His His His His 
195 200 205 

CAT CAC TAATAG 
His His 



210 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Met Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
15 10 15 

Val Gly Tyr lie Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

OThr Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu 
gg 50 55 60 

S^eu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
]H 65 70 75 80 

SSer Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 

Lfl 85 90 95 

./Tyr Glu Ala Ala Asp Met lie Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 

■%al Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
H 115 120 125 

I y 

\SLeu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr lie Arg Arg His 
Cf 130 135 140 

Val Asp Ser Gin Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val 
145 150 155 160 

Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His lie Thr Gly His Arg 
165 170 175 

Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val 
180 185 190 

Val Ser Gin Leu Leu Arg He Val He Glu Gly Arg His His His His 
195 200 205 

; " His His 
210 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 



• ♦ 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 

(A) NAME /KEY: matjpeptide 

(B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

_ATG GGT AAG GTC ATC GAT ACC CTT ACG TGC GGA TTC GCC GAT. CTC ATG 48 

ClMet Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asu Leu Met 

pi 1 5 10 15 

^11 

mGGG TAC ATC CCG CTC GTC GGC GCT CCC GTA GGA GGC GTC GCA AGA GCC 96 

^Gly Tyr lie Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 

20 25 30 

pfCTT GCG CAT GGC GTG AGG GCC CTT GAA GAC GGG ATA AAT TTC GCA ACA 14 4 

Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly lie Asn Phe Ala Thr 

s 35 40 45 



GGG AAT TTG CCC GGT TGC TCC TTT TCT ATT TTC CTT CTC GCT CTG TTC 192 

Gly Asn Leu Pro Gly Cys Ser Phe Ser lie Phe Leu Leu Ala Leu Phe 

l m 50 55 60 

3 TCT TGC TTA ATT CAT CCA GCA GCT AGT CTA GAG TGG CGG AAT ACG TCT 240 

SI Ser Cys Leu lie His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 

65 70 75 80 

GGC CTC TAT GTC CTT ACC AAC GAC TGT TCC AAT AGC AGT ATT GTG TAC ' 288 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser lie Val Tyr 

85 90 95 

GAG GCC GAT GAC GTT ATT CTG CAC ACA CCC GGC TGC ATA CCT TGT GTC 336 

Glu Ala Asp Asp Val lie Leu His Thr Pro Gly Cys lie Pro Cys Val 

100 105 110 

CAG GAC GGC AAT ACA TCC ACG TGC TGG ACC CCA GTG ACA CCT ACA GTG 38 4 

Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 

115 120 125 

GCA GTC AAG TAC GTC GGA GCA ACC ACC GCT TCG ATA CGC AGT CAT GTG 432 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 

130 135 140 

GAC CTA TTA GTG GGC GCG GCC ACG ATG TGC TCT GCG CTC TAC GTG GGT 480 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 

145 150 155 160 



GAC ATG TGT GGG GCT GTC TTC CTC GTG GGA CAA GCC TTC ACG TTC AGA 528 
Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 

CCT CGT CGC CAT CAA ACG GTC CAG ACC TGT AAC TGC TCG CTG TAC CCA 576 
Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

GGC CAT CTT TCA GGA CAT CGA ATG GCT TGG GAT ATG ATG ATG AAC TGG 624 
Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 

TAATAG 634 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

fn (ii) MOLECULE TYPE: protein 

jS (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

^Met Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
1 5 10 15 

Hi Gly Tyr He Pro Leu Val Gly Ala Pro Val Gly Gly Val Ala Arg Ala 
20 25 30 

"5 Leu Ala His Gly Val Arg Ala Leu Glu Asp Gly He Asn Phe Ala Thr 
'f* 35 40 45 

m Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu Phe 
y3 50 55 60 

Ser Cys Leu He His Pro Ala Ala Ser Leu Glu Trp Arg Asn Thr Ser 
65 70 75 80 

Gly Leu Tyr Val Leu Thr Asn Asp Cys Ser Asn Ser Ser He Val Tyr 
85 90 95 

Glu Ala Asp Asp Val He Leu His Thr Pro Gly Cys He Pro Cys Val 
100 105 110 

Gin Asp Gly Asn Thr Ser Thr Cys Trp Thr Pro Val Thr Pro Thr Val 
115 120 125 

Ala Val Lys Tyr Val Gly Ala Thr Thr Ala Ser He Arg Ser His Val 
130 135 140 

Asp Leu Leu Val Gly Ala Ala Thr Met Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Met Cys Gly Ala Val Phe Leu Val Gly Gin Ala Phe Thr Phe Arg 
165 170 175 




Pro Arg Arg His Gin Thr Val Gin Thr Cys Asn Cys Ser Leu Tyr Pro 
180 185 190 

Gly His Leu Ser Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1..627 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..624 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

HVTG GGT AAG GTC ATC GAT ACC CTA ACG TGC GGA TTC GCC GAT CTC ATG 48 
FlKet Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 

m 1 5 10 15 

ISGG TAT ATC CCG CTC GTA GGC GGC CCC ATT GGG GGC GTC GCA AGG GCT 96 
Gl'y Tyr He Pro Leu Val Gly Gly Pro He Gly Gly Val Ala Arg Ala 
20 25 30 

CTC GCA CAC GGT GTG AGG GTC CTT GAG GAC GGG GTA AAC TAT GCA ACA 144 
Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

GGG AAT TTA CCC GGT TGC TCT TTC TCT ATC TTT ATT CTT GCT CTT CTC 192 
Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

TCG TGT CTG ACC GTT CCG GCC TCT GCA GTT CCC TAC CGA AAT GCC TCT 240 
Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 

GGG ATT TAT CAT GTT ACC AAT GAT TGC CCA AAC TCT TCC ATA GTC TAT 288 
Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 

GAG GCA GAT AAC CTG ATC CTA CAC GCA CCT GGT TGC GTG CCT TGT GTC 336 



Glu Ala Asp Asn Leu lie Leu His Ala Pro Gly Cys Val Pro Cys Vai 
100 105 110 

ATG ACA GGT AAT GTG AGT AGA TGC TGG GTC CAA ATT ACC CCT ACA CTG 384 
Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 
115 120 125 

TCA GCC CCG AGC CTC GGA GCA GTC ACG GCT CCT CTT CGG AGA GCC GTT 432 
Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

GAC TAC CTA GCG GGA GGG GCT GCC CTC TGC TCC GCG TTA TAC GTA GGA 480 
Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

GAC GCG TGT GGG GCA CTA TTC TTG GTA GGC CAA ATG TTC ACC TAT AGG 528 
Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

CCT CGC CAG CAC GCT ACG GTG CAG AAC TGC AAC TGT TCC ATT TAC AGT 57 6 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
180 185 190 

fGGC CAT GTT ACC GGC CAC CGG ATG GCA TGG GAT ATG ATG ATG AAC TGG 624 

SSly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
"I 195 200 205 

rtfAATAG 630 



: (2) INFORMATION FOR SEQ ID NO; 32: 

M (i) SEQUENCE CHARACTERISTICS: 
yQ (A) LENGTH: 208 amino acids 

U (B) TYPE: amino acid 

hj (D) TOPOLOGY: linear 

H (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Met Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Met 
15 10 15 

Gly Tyr He Pro Leu Vai Gly Gly Pro lie Gly Gly Val Ala Arg Ala 
20 25 30 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala Thr 
35 40 45 

Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe He Leu Ala Leu Leu 
50 55 60 

Ser Cys Leu Thr Val Pro Ala Ser Ala Val Pro Tyr Arg Asn Ala Ser 
65 70 75 80 



Gly He Tyr His Val Thr Asn Asp Cys Pro Asn Ser Ser He Val Tyr 
85 90 95 



Glu Ala Asp Asn Leu He Leu His Ala Pro Gly Cys Val Pro Cys Val 
100 105 110 

Met Thr Gly Asn Val Ser Arg Cys Trp Val Gin He Thr Pro Thr Leu 
115 120 125 

Ser Ala Pro Ser Leu Gly Ala Val Thr Ala Pro Leu Arg Arg Ala Val 
130 135 140 

Asp Tyr Leu Ala Gly Gly Ala Ala Leu Cys Ser Ala Leu Tyr Val Gly 
145 150 155 160 

Asp Ala Cys Gly Ala Leu Phe Leu Val Gly Gin Met Phe Thr Tyr Arg 
165 170 175 

Pro Arg Gin His Ala Thr Val Gin Asn Cys Asn Cys Ser He Tyr Ser 
180 185 190 

Gly His Val Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 
195 200 205 



;^{2} INFORMATION FOR SEQ ID NO: 33: 

W (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 23 base pairs 

H (B) TYPE: nucleic acid . 

1H (C) STRANDEDNESS: single 

y (D) TOPOLOGY: linear 

jU (ii) MOLECULE TYPE: cDNA 

;Q (iii) HYPOTHETICAL: NO 

fy (iii) ANTI-SENSE: NO 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TGGGATATGA TGATGAACTG GTC 23 

(2) INFORMATION FOR SEQ ID NO: 34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



CTATTATGGT GGTAAGCCAC AGAGCAGGAG 30 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1. .1473 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1. .1470 



PJ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 



^GG 


GAT ATG 


ATG 


ATG 


AAC 


TGG 


TCG 


CCT 


ACA 


ACG 


GCC 


CTG 


GTG GTA TCG 


48 




Asp Met 


Met 


Met 


Asn 


Trp 


Ser 


Pro 


Thr 


Thr 


Ala 


Leu 


Val Val Ser 




%.i 1 






5 










10 








15 




:cag 


CTG CTC 


CGG 


ATC 


CCA 


CAA 


GCT 


GTC 


GTG 


GAC 


ATG 


GTG 


GCG GGG GCC 


96 




Leu Leu 


Arg 


He 


Pro 


Gin 


Ala 


Val 


Val 


Asp 


Met 


Val 


Ala Gly Ala 








20 










25 










30 




pSAT 


TGG GGA 


GTC 


CTG 


GCG 


GGC 


CTC 


GCC 


TAC 


TAT 


TCC 


ATG 


GTG GGG AAC 


144 


!Iis 


Trp Gly 


Val 


Leu 


Ala 


Gly 


Leu 


Ala 


Tyr 


Tyr 


Ser 


Met 


Val Gly Asn 






35 










40 










45 








GCT AAG 


GTT 


TTG 


GTT 


GTG 


ATG 


CTA 


CTC 


TTT 


GCC 


GGC 


GTC GAC GGG 


192 


Trp 


Ala Lys 


Val 


Leu 


Val 


Val 


Met 


Leu 


Leu 


Phe 


Ala Gly 


Val Asp Gly 






50 








55 










60 








LCAT 


ACC CGC 


GTG 


TCA 


GGA 


GGG 


GCA 


GCA 


GCC 


TCC 


GAT 


ACC 


AGG GGC CTT 


240 


'His 


Thr Arg 


Val 


Ser 


Gly 


Gly 


Ala 


Ala 


Ala 


Ser 


Asp Thr 


Arg Gly Leu 




65 








70 










75 






80 




GTG 


TCC CTC 


TTT 


AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG 


CTC GTA AAC 


288 


Val 


Ser Leu 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin 


Leu Val Asn 










85 










90 








95 




ACC 


AAC GGC 


AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 


TGC AAC GAC 


336 


Thr 


Asn Gly 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys Asn Asp 








100 










105 










110 




TCC 


CTC CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA 


CAC AAA TTC 


384 


Ser 


Leu Gin 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr Lys 


His Lys Phe 






115 










120 










125 






AAC 


TCG TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC 


TCC ATC GAC 


432 



% • 



Asn 


Ser 


Ser 


Gly 


Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


He 


Asp 






130 










135 










140 












AAG 


TTC 


GCT 


CAG 


GGG 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


480 


Lys 


Phe 


Ala 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 




145 










150 










155 










160 




TCG 


GAC 


CAG 


AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


528 


Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 












165 










170 










175 






ATT 


GTA 


CCC 


GCG 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


576 


He 


Val 


Pro 


Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 










180 










IOC 

185 










190 








AGC 


CCT 


GTT 


GTG 


GTG 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


624 


Ser 


Pro 


Val 


Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 








195 










200 










205 










AAC 


TGG 


GGG 


GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


672 


Asn 


Trp 


Gly 


Ala 


Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 






210 










215 










220 












pCCG 


CCG 


CGA 


GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


720 


Sfro 


Pro 


Arg 


Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


Trp 


Met 


Asn 


Gly 


Thr 


Gly 














230 










235 










240 




fl?TC 


ACC 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


768 


fflPhe 


Thr 


Lys 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


He 


Gly 


Gly 


Ala 


Gly 












245 










250 










255 






m 

r AAC 


AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


816 


%sn 


Asn 


Thr 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 










260 










265 










270 








Gbcc 


ACC 


TAC 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


864 




Thr 


Tyr 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 








**V ~T c 

275 










280 










285 










ill 


GTT 


CAT 


TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


912 




Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 






i rt A 

290 










*"i Ci c 

295 










300 












: <TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


960 


Phe 


Thr 


He 


Phe 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 




305 










310 










315 










320 




TTC 


GAA 


GCC 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


1008 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 












325 










330 










335 








AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


CTG 


TCT 


ACA 


ACA 


GAG 


1056 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 


Leu 


Ser 


Thr 


Thr 


Glu 










340 










345 










350 








TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


CCG 


GCC 


CTA 


TCC 


ACC 


1104 


Trp 


Gin 


He 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro 


Ala 


Leu 


Ser 


Thr 






355 








360 










365 










GGC 


CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC 


GTG 


GAC 


GTG 


CAA 


TAC 


CTG 


TAC 


1152 


Gly 


Leu 


He 


His 


Leu 


His 


Gin 


Asn 


He 


Val 


Asp 


Val 


Gin 


Tyr 


Leu 


Tyr 




370 










375 










380 













GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC AAA TGG GAG TAT GTC 1200 
Gly Val Gly Ser Ala Val Val Ser Leu Val lie Lys Trp Glu Tyr Val 
385 390 395 400 

CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC ATC TGC GCC TGC TTA 1248 
Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg lie Cys Ala Cys Leu 
405 410 415 

TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC GCC TTA GAG AAC CTG 1296 
Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG CAT GGC ACT CTT TCC 134 4 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
435 440 445 

TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC AAG GGC AGG CTG GTC 1392 
Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He Lys Gly Arg Leu Val 
450 455 460 

CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG CCG CTG CTC CTG CTT 14 4 0 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
e H*65 470 475 480 

SpTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 14 76 

"^MLeu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
III 485 490 

Sj(2) INFORMATION FOR SEQ ID NO: 36: 

t\ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 90 amino acids 
L (B) TYPE: amino acid 

M (D) TOPOLOGY: linear 

fS (ii) MOLECULE TYPE: protein 

% (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

>lTrp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser 
1 5 10 15 

Gin Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala 
20 25 30 

His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn 
35 40 45 

Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly 
50 55 60 

His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu 
65 70 75 80 

Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn 
85 90 95 

Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp 
100 105 110 
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435 440 445 

Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie Lys Gly Arg Leu Val 
450 455 460 

Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu 
465 470 475 480 

Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
485 490 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(BJ LOCATION:. 2. .1018 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 2,. 1015 



fl (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

fy G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

% lie Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
n 1 5 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 

TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC 238 
Phe Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr Asn Gly 
65 70 75 

AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC CAA 28 6 

Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin 
80 85 90 95 



% 



ACA GGG TTC TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT 334 

Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser 
100 105 110 

GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT 382 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys Phe Ala 
115 120 125 

CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG GAC CAG 430 

Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin 
130 135 140 

AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT GTA CCC 478 

Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro 

145 150 155 

GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC CCT GTT 526 

Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 

160 165 170 175 

GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC TGG GGG 57 4 

Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly 
180 185 190 

SsCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG CCG CGA 622 

"2JUa Asn Asd Ser Asd Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg 
^ ~ 195 * 200 205 

SSGGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC ACC AAG 670 

SJGly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys 
ir i 210 215 220 

~ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC AAC ACC 718 

a Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr 

Q 225 230 235 

keTTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC ACC TAC 766 

kjLeu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr 

[=240 245 250 255 

S| GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG GTT CAT 814 

Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His 
260 265 270 

TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC ACC ATC 8 62 

Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr He 
275 280 285 

TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC GAA GCC 910 

Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala 
290 295 300 

GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC AGG GAT 958 

Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 

305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG CAG AGT 1006 

Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser 

320 325 330 335 

GGC AGA GCT TAATTA 1021 
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Gly Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
15 10 15 

Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val 
20 25 30 

Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
35 40 45 

Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 
65 70 75 80 

Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
85 90 95 

Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 

Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr 
210 215 220 

Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu 
225 230 235 240 

Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala 



245 



250 



255 
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Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr 
260 265 270 

Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe 
275 280 285 

Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala 
290 295 300 

Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg 
305 310 315 320 

Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin Ser Gly 
325 330 335 



Arg Ala 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2. .1032 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 2. .1029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

G ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA 4 6 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly 
15 10 15 

GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG 94 
Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys 
t 20 25 30 

GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC 142 
Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg 
35 40 45 

GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC 190 
Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu 
50 55 60 
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Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe 
115 120 125 

Asn Ser'Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp 
130 135 140 

Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser 
145 150 155 160 

Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly 
165 170 175 

He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro 
180 185 190 

Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr 
195 200 205 

Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg 
210 215 220 

Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly 
225 230 235 240 

3>he Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly 
^ 245 250 255 

Piksn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu 

ffl 260 265 270 

;^Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys 
Zl 275 280 285 

s Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn 
O 290 295 300 

y^Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg 
m305 310 315 320 

^Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu 
"N 325 330 335 

Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu 
340 345 350 

i 

Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr 
355 360 365 

Gly Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr 
) 370 375 380 

Gly Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val 
385 390 395 400 

Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu 
405 410 415 

Trp Met Met Leu Leu He Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu 
420 425 430 

Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala His Gly Thr Leu Ser 
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TTT 


AGC 


CCC 


GGG 


TCG 


GCT 


CAG 


AAA 


ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


238 


Phe 


Ser 


Pro 


Gly 


Ser 


Ala 


Gin 


Lys 


He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 






65 










70 








75 












AGT 


TGG 


CAC 


ATC 


AAC 


AGG 


ACT 


GCC 


CTG 


AAC 


TGC 


AAC 


GAC 


TCC 


CTC 


CAA 


286 


Ser 


Trp 


His 


He 


Asn 


Arg 


Thr 


Ala 


Leu 


Asn 


Cys 


Asn 


Asp 


Ser 


Leu 


Gin 




80 








85 










90 










95 




ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


TAC 


AAA 


CAC 


AAA 


TTC 


AAC 


TCG 


TCT 


334 


Thr 


Gly 


Phe 


Phe 


Ala 


Ala 


Leu 


Phe 


Tyr 


Lys 


His 


Lys 


Phe 


Asn 


Ser 


Ser 












100 










105 










110 






GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


TGT 


CGC 


TCC 


ATC 


GAC 


AAG 


TTC 


GCT 


382 


Gly 


Cys 


Pro 


Glu 


Arg 


Leu 


Ala 


Ser 


Cys 


Arg 


Ser 


lie 


Asp 


Lys 


Phe 


Ala 










115 










120 










125 








CAG 


GGG 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


ACT 


GAG 


CCT 


AAC 


AGC 


TCG 


GAC 


CAG 


430 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 


Thr 


Glu 


Pro 


Asn 


Ser 


Ser 


Asp 


Gin 








130 










135 










140 










AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


CCT 


CGA 


CCG 


TGT 


GGT 


ATT 


GTA 


CCC 


478 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 


Pro 


Arg 


Pro 


Cys 


Gly 


lie 


Val 


Pro 






145 










150 










155 












;gcg 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


TAT 


TGC 


TTC 


ACC 


CCG 


AGC 


CCT 


GTT 


526 


fAla 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 


Tyr 


Cys 


Phe 


Thr 


Pro 


Ser 


Pro 


Val 




fl60 










165 










170 










175 




^GTG 


GTG 


GGG 


ACG 


ACC 


GAT 


CGG 


TTT 


GGT 


GTC 


CCC 


ACG 


TAT 


AAC 


TGG 


GGG 


574 


^Val 


Val 


Gly 


Thr 


Thr 


Asp 


Arg 


Phe 


Gly 


Val 


Pro 


Thr 


Tyr 


Asn 


Trp 


Gly 












180 










185 










190 






F GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


CTC 


AAC 


AAC 


ACG 


CGG 


CCG 


CCG 


CGA 


622 


: Ala 


Asn 


Asp 


Ser 


Asp 


Val 


Leu 


He 


Leu 


Asn 


Asn 


Thr 


Arg 


Pro 


Pro 


Arg 










195 










200 










205 








GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


670 


\Gly 


Asn 


Trp 


Phe 


Gly 


Cys 


Thr 


Trp 


Met 


Asn 


Gly 


Thr 


Gly 


Phe 


Thr 


Lys 






210 










215 










220 










=ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


ATC 


GGG 


GGG 


GCC 


GGC 


AAC 


AAC 


ACC 


718 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 


He 


Gly 


Gly 


Ala 


Gly 


Asn 


Asn 


Thr 






225 










230 










235 












TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


CGG 


AAG 


CAC 


CCC 


GAG 


GCC 


ACC 


TAC 


766 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 


Arg 


Lys 


His 


Pro 


Glu 


Ala 


Thr 


Tyr 




240 










245 










250 










255 




GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


CTG 


ACA 


CCT 


AGG 


TGT 


ATG 


GTT 


CAT 


814 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 


Leu 


Thr 


Pro 


Arg 


Cys 


Met 


Val 


His 












260 










265 










270 






TAC 


CCA 


TAT 


AGG 


CTC 


TGG 


CAC 


TAC 


CCC 


TGC 


ACT 


GTC 


AAC 


TTC 


ACC 


ATC 


862 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 


Pro 


Cys 


Thr 


Val 


Asn 


Phe 


Thr 


He 










275 










280 










285 








TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


GGC 


GTG 


GAG 


CAC 


AGG 


TTC 


GAA 


GCC 


910 


Phe 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 


Gly 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 








290 










295 










300 










GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


CGT 


TGT 


GAC 


TTG 


GAG 


GAC 


AGG 


GAT 


958 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 


Arg 


Cys 


Asp 


Leu 


Glu 


Asp 


Arg 


Asp 
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305 310 315 

AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GGT GAT CGA GGG 1006 
Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Gly Asp Arg Gly 
320 325 330 335 

CAG ACA CCA TCA CCA CCA TCA CTA AT AG 1034 
Gin Thr Pro Ser Pro Pro Ser Leu 
340 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His Trp Gly Val 
_ 1 5 10 15 

Seu Ala Gly Leu Ala Tyr Tyr Ser Met . Val Gly Asn Trp Ala Lys Val 
y3 20 25 30 

f|eu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val 
M 35 40 45 

^er Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe 
50 55 60 

C|er Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser 

y365 70 75 80 

Strp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr 
l t 85 90 95 

Slly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly 
100 105 110 

Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser He Asp Lys Phe Ala Gin 
115 120 125 

Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser Asp Gin Arg 
130 135 140 

Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He Val Pro Ala 
145 150 155 160 

Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val Val 
165 170 175 

Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn Trp Gly Ala 
180 185 190 

Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro Pro Arg Gly 
195 200 205 



Asn Trp Phe Gly Cys Thr Trp 
210 215 

Cys Gly Gly Pro Pro Cys Asn 
225 230 

Thr Cys Pro Thr Asp Cys Phe 
245 

Arg Cys Gly Ser Gly Pro Trp 
260 

Pro Tyr Arg Leu Trp His Tyr 
275 

Lys Val Arg Met Tyr Val Gly 
290 295 

Cys Asn Trp Thr Arg Gly Glu 
305 310 

Ser Glu Leu Ser Pro Leu Leu 
325 

^fjhr Pro Ser Pro Pro Ser Leu 

yy 340 



flj{2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
't\ (A) LENGTH: 945 base pairs 

H (B) TYPE: nucleic acid 

5S (C) STRANDEDNESS: single 

O (D) TOPOLOGY: linear 

S (ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1..942 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 
(B> LOCATION: 1..939 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 
Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 



Met Asn Gly Thr 



Gly Phe Thr Lys Thr 
220 



He Gly Gly Ala 
235 



Gly Asn Asn Thr Leu 
240 



Arg Lys His Pro 
250 



Glu Ala Thr Tyr Ala 
255 



Leu Thr Pro Arg 
265 



Cys Met Val His Tyr 
270 



Pro Cys Thr Val 
280 



Asn Phe Thr He Phe 
285 



Gly Val Glu His 



Arg Phe Glu Ala Ala 
300 



Arg Cys Asp Leu 
315 



Glu Asp Arg Asp Arg 
320 



Leu Ser Thr Thr 
330 



Gly Asp Arg Gly Gin 
335 



GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 




• 



ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 
Gin Leu Val Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu 
50 55 60 

AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAG 240 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 288 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 336 
Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 384 
Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

jrffcGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 432 

u ^Azq Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
^3 130 135 140 

OJTGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 480 

SJCys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
yjl45 150 155 160 

'^GTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 528 
- Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
Q 165 170 175 

pAAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 57 6 

hsAsn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
^ 180 185 190 

% ^AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 624 
Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 672 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 720 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 768 
Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 816 
Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 864 



HCVENV.-T 



Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

TCT ACA ACA GAG TGG CAG AGC TTA ATT AAT TAG 945 

Ser Thr Thr Glu Trp Gin Ser Leu lie Asn 

305 310 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 314 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

PMet Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
Si 5 10 15 

Jfely Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asd 
^ 20 25 30 

Mrhr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
HI 35 40 45 

%j 

Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
~ n 50 55 60 

^Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
H 65 * 70 75 80 

C;Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
Cj 85 90 95 

Arg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
;* 130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 



% 



Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 
225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 

Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Glu Trp Gin Ser Leu He Asn 
305 310 

(2) INFORMATION FOR SEQ ID NO: 43: 

0Q (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 base pairs 
=r ; (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

HI (ii) MOLECULE TYPE: cDNA 
5 *' (iii) HYPOTHETICAL: NO 
y (iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..958 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 1..955 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC 48 
Met Val Gly Asn TrD Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
1 5 10 15 

GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT 96 
Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
20 25 30 

ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC 144 
Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He 
35 40 45 

CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG 192 



% 



Gin Leu Val Asn Thr Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu 
50 55 60 



AAC TGC AAC GAC TCC CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAG 
Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 



240 



AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT 
Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 



288 



CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT 
Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
100 105 110 



336 



GAG CCT AAC AGC TCG GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT 
Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 



384 



CGA CCG TGT GGT ATT GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT 
Arg Pro Cys Gly lie Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 



432 



TGC TTC ACC CCG AGC CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT 
^Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
^Ol45 150 155 160 



480 



fSGTC CCC ACG TAT AAC TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC 
SA Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asd Val Leu lie Leu 
in 165 170 * 175 



528 



^ AAC AAC ACG CGG CCG CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG 
s ^ Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
U 180 185 190 



576 



M= AAT GGC ACT GGG TTC ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC 
hi Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie" 
% 195 200 205 



624 



^ GGG GGG GCC GGC AAC AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG 
Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 
210 215 220 



672 



AAG CAC CCC GAG GCC ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG 
Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Glv Ser Gly Pro Trp Leu 
225 230 235 240 



720 



ACA CCT AGG TGT ATG GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC 
Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 



768 



TGC ACT GTC AAC TTC ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC 
Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 



816 



GTG GAG CAC AGG TTC GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT 
Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 



864 



TGT GAC TTG GAG GAC AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG 912 
Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 



290 



295 



300 



TCT ACA ACA GGT GAT CGA GGG CAG ACA CCA TCA CCA CCA TCA CTA A 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 

305 310 315 
TAG 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe Ala 
15 10 15 

J51y Val Aso Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp 
=3 * 20 25 30 

"IThr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie 
{j 35 40 45 

%ln Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala Leu 
^ 50 55 60 



4Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr 
65 70 75 80 

5Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys 
85 90 95 

JArg Ser He Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr 
2 100 105 110 

4 

Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro 
115 120 125 

Arg Pro Cys Gly He Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr 
130 135 140 

Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly 
145 150 155 160 

Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu 
165 170 175 

Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met 
180 185 190 

Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He 
195 200 205 

Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg 

210 215 220 

Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu 



225 230 235 240 

Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro 
245 250 255 

Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly Gly 
260 265 270 

Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg 
275 280 285 



Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu 
290 295 300 

Ser Thr Thr Gly Asp Arg Gly Gin Thr Pro Ser Pro Pro Ser Leu 
305 310 315 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1395 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..1392 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



ATG 


GTG 


GCG 


GGG 


GCC 


CAT 


TGG 


GGA 


GTC 


CTG 


GCG 


GGC CTC GCC TAC TAT 


48 


Met 


Val 


Ala 


Gly 


Ala 


His 


Trp 


Gly 


Val 


Leu 


Ala 


Gly Leu Ala Tyr Tyr 




1 






5 










10 




15 




TCC 


ATG 


GTG 


GGG 


AAC 


TGG 


GCT 


AAG 


GTT 


TTG 


GTT 


GTG ATG CTA CTC TTT 


96 


Ser 


Met 


Val 


Gly 


Asn 


Trp 


Ala 


Lys 


Val 


Leu 


Val 


Val Met Leu Leu Phe 










20 








25 






30 




GCC 


GGC 


GTC 


GAC 


GGG 


CAT 


ACC 


CGC 


GTG 


TCA 


GGA 


GGG GCA GCA GCC TCC 


144 


Ala Gly Val 


Asp 


Gly 


His 


Thr 


Arg 


Val 


Ser 


Gly 


Gly Ala Ala Ala Ser 








35 










40 








45 




GAT 


ACC 


AGG 


GGC 


CTT 


GTG 


TCC 


CTC 


TTT 


AGC 


CCC 


GGG TCG GCT CAG AAA 


192 


Asp Thr Arg 


Gly 


Leu 


Val 


Ser 


Leu 


Phe 


Ser 


Pro 


Gly Ser Ala Gin Lys 






50 










55 










60 




ATC 


CAG 


CTC 


GTA 


AAC 


ACC 


AAC 


GGC 


AGT 


TGG 


CAC 


ATC AAC AGG ACT GCC 


240 


He 


Gin 


Leu 


Val 


Asn 


Thr 


Asn 


Gly 


Ser 


Trp 


His 


He Asn Arg Thr Ala 




65 










70 










75 


80 





% % 



CTG 


A a /** 

AAC 


TGC 


AAC 


GAC 


TCC 


CTC 


CAA 


ACA 


GGG 


TTC 


TTT 


GCC 


GCA 


CTA 


TTC 


288 


T «... 

Leu 


Asn 


Cys 


Asn 


Asp 


Ser 


Leu 


Gin 


Thr 


Gly 


rne 


rne 


Ala 


AT — 

Ala 


Leu 


Phe 












85 










90 










95 






TAC 


AAA 


CAC 


AAA 


TTC 


AAC 


TCG 


TCT 


GGA 


TGC 


CCA 


GAG 


CGC 


TTG 


GCC 


AGC 


336 


Tyr 


Lys 


His 


Lys 


Phe 


Asn 


Ser 


Ser 


Gly 


Cys 


Pro 


GlU 


Arg 


Leu 


Ala 


Ser 








100 










105 










110 








TGT 


CGC 


TCC 


ATC 


GAC 


AAG 


TTC 


GCT 


CAG 


GGG 


TGG 


GGT 


CCC 


CTC 


ACT 


TAC 


384 


Cys 


Arg 


Ser 


lie 


Asp 


Lys 


Phe 


Ala 


Gin 


Gly 


Trp 


Gly 


Pro 


Leu 


Thr 


Tyr 








115 










120 










125 










ACT 


GAG 


CCT 


AAC 


AGC 


TCG 


GAC 


CAG 


AGG 


CCC 


TAC 


TGC 


TGG 


CAC 


TAC 


GCG 


432 


Thr 


Glu 


Pro 


Asn 


Ser 


Ser 


Asp 


Gin 


Arg 


Pro 


Tyr 


Cys 


Trp 


His 


Tyr 


Ala 






130 










135 










140 












CCT 


CGA 


CCG 


TGT 


GGT 


ATT 


GTA 


CCC 


GCG 


TCT 


CAG 


GTG 


TGC 


GGT 


CCA 


GTG 


480 


Pro 


Arg 


Pro 


Cys 


Gly 


lie 


Val 


Pro 


Ala 


Ser 


Gin 


Val 


Cys 


Gly 


Pro 


Val 




145 










150 










155 










160 




TAT 


TGC 


TTC 


ACC 


CCG 


AGC 


CCT 


GTT 


GTG 


GTG 


GGG 


ACG 


AC*- 


GAT 


CGG 


TTT 


coo 

528 


-Tyr 


Cys 


nu ^ 


inr 


Pro 


oer 


Pro 


vax 


vax 


val 


uiy 


inr 


i nr 


Aso 


Arg 


rne 










165 










170 










175 






Iggt 


GTC 


CCC 


ACG 


TAT 


AAC 


TGG 


GGG 


GCG 


AAC 


GAC 


TCG 


GAT 


GTG 


CTG 


ATT 


576 


jGly 


Val 


Pro 


Tnr 


Tyr 


Asn 


Trp 


Gly 


A 1 -, 

Ala 


Asn 


Asp 


Ser 


A — — 

ASp 


Val 


Leu 


lie 










180 










185 










190 








:CTC 


AAC 


AAC 


ACG 


CGG 


CCG 


CCG 


CGA 


GGC 


AAC 


TGG 


TTC 


GGC 


TGT 


ACA 


TGG 


624 


; Leu 


Asn 


Asn 


Thr 


Arg 


Pro 


Pro 


Arg 


Gly 


Asn 


Trp 


nu A 

Pne 


Gly 


Cys 


Thr 


Trp 








195 










200 










205 










; ATG 


AAT 


GGC 


ACT 


GGG 


TTC 


ACC 


AAG 


ACG 


TGT 


GGG 


GGC 


CCC 


CCG 


TGC 


AAC 


672 


;Met 


Asn 


Gly 


Thr 


Gly 


Phe 


Thr 


Lys 


Thr 


Cys 


Gly 


Gly 


Pro 


Pro 


Cys 


Asn 






210 










215 










220 












ATC 


GGG 


GGG 


GCC 


GGC 


AAC 


AAC 


ACC 


TTG 


ACC 


TGC 


CCC 


ACT 


GAC 


TGT 


TTT 


720 


; lie 


Gly 


Gly 


Ala 


Gly 


Asn 


Asn 


Thr 


Leu 


Thr 


Cys 


Pro 


Thr 


Asp 


Cys 


Phe 




225 










230 










235 










240 




CGG 


AAG 


CAC 


CCC 


GAG 


GCC 


ACC 


TAC 


GCC 


AGA 


TGC 


GGT 


TCT 


GGG 


CCC 


TGG 


768 


Arg 


Lys 


His 


Pro 


Glu 


Ala 


Thr 


Tyr 


Ala 


Arg 


Cys 


Gly 


Ser 


Gly 


Pro 


Trp 












245 










250 










255 






CTG 


A A 

ACA 


CCT 


AGG 


TGT 


ATG 


GTT 


CAT 


TAC 


CCA 


TAT 


AGG 


CTC 


T^va 


CAC 


TAC 


o lo 


Leu 


Thr 


Pro 


Arg 


Cys 


Met 


Val 


His 


Tyr 


Pro 


Tyr 


Arg 


Leu 


Trp 


His 


Tyr 










260 










265 










270 








CCC 


TGC 


ACT 


GTC 


AAC 


TTC 


ACC 


ATC 


TTC 


AAG 


GTT 


AGG 


ATG 


TAC 


GTG 


GGG 


864 


Fro 


Cys 


Tnr 


val 


Asn 


rne 


Tnr 


lie 


Pne 


Lys 


Val 


Arg 


Met 


Tyr 


Val 


Gly 








275 










280 










285 










GGC 


GTG 


GAG 


CAC 


AGG 


TTC 


GAA 


GCC 


GCA 


TGC 


AAT 


TGG 


ACT 


CGA 


GGA 


GAG 


912 


Gly 


Val 


Glu 


His 


Arg 


Phe 


Glu 


Ala 


Ala 


Cys 


Asn 


Trp 


Thr 


Arg 


Gly 


Glu 






290 










295 










300 












CGT 


TGT 


GAC 


TTG 


GAG 


GAC 


AGG 


GAT 


AGA 


TCA 


GAG 


CTT 


AGC 


CCG 


CTG 


CTG 


960 


Arg 


Cys 


Asp 


Leu 


Glu 


Asp 


Arg 


Asp 


Arg 


Ser 


Glu 


Leu 


Ser 


Pro 


Leu 


Leu 




305 










310 










315 










320 




CTG 


TCT 


ACA 


ACA 


GAG 


TGG 


CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


1008 



Leu Ser Thr Thr Glu Trp Gin lie Leu Pro Cys Ser Phe Thr Thr Leu 

325 330 335 

CCG GCC CTA TCC ACC GGC CTG ATC CAC CTC CAT CAG AAC ATC GTG GAC 

Pro Ala Leu Ser Thr Gly Leu lie His Leu His Gin Asn lie Val Asp 

340 345 350 

GTG CAA TAC CTG TAC GGT GTA GGG TCG GCG GTT GTC TCC CTT GTC ATC 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 

355 360 365 

AAA TGG GAG TAT GTC CTG TTG CTC TTC CTT CTC CTG GCA GAC GCG CGC 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 

370 375 380 

ATC TGC GCC TGC TTA TGG ATG ATG CTG CTG ATA GCT CAA GCT GAG GCC 

He Cys Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 

385 390 395 400 

GCC TTA GAG AAC CTG GTG GTC CTC AAT GCG GCG GCC GTG GCC GGG GCG 
Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 



QZAT GGC ACT CTT TCC TTC CTT GTG TTC TTC TGT GCT GCC TGG TAC ATC 

OlHis Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr He 

|q 420 425 430 

liAAG GGC AGG CTG GTC CCT GGT GCG GCA TAC GCC TTC TAT GGC GTG TGG 

"~Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
^ 435 440 445 

MCCG CTG CTC CTG CTT CTG CTG GCC TTA CCA CCA CGA GCT TAT GCC TAGTAA 

=■ Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
H 450 455 460 



;;^(2) INFORMATION FOR SEQ ID NO: 46: 

*Q (i) SEQUENCE CHARACTERISTICS: 

%d (A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 6: 

Met Val Ala Gly Ala His Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr 
15 10 15 

Ser Met Val Gly Asn Trp Ala Lys Val Leu Val Val Met Leu Leu Phe 
20 25 30 

' Ala Gly Val Asp Gly His Thr Arg Val Ser Gly Gly Ala Ala Ala Ser 
35 40 45 

Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys 



1056 
1104 
1152 
1200 
1248 
1296 
1344 
1395 



55 



60 



He Gin Leu Val Asn Thr Asn Gly Ser Trp His He Asn Arg Thr Ala 
65 70 75 80 



Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe 
85 90 95 

Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser 
100 105 110 

Cvs Arg Ser lie Asp Lys Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr 
115 120 125 

Thr Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala 
130 135 140 

Pro Arg Pro Cys Gly lie Val Pro Ala Ser Gin Val Cys Gly Pro Val 
145 150 155 160 

Tvr Cys Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Phe 
Y 165 170 175 

Gly Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp Ser Asp Val Leu He 
180 185 190 

Leu Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp 
195 200 205 

" let Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn 
■r 210 215 220 

■' le Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe 
i !25 ' 230 235 240 

' krg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp 
245 250 255 

,eu Thr Pro Arg Cys Met Val His Tyr Pro Tyr Arg Leu Trp His Tyr 
3 260 265 270 

.•' >ro Cys Thr Val Asn Phe Thr He Phe Lys Val Arg Met Tyr Val Gly 
275 280 285 

^ ly Val Glu His Arg Phe Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu 

-•' 290 295 300 

Lrg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu 
105 310 315 320 

Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys Ser Phe Thr Thr Leu 
325 330 335 

Pro Ala Leu Ser Thr Gly Leu lie His Leu His Gin Asn He Val Asp 
340 345 350 

Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val Val Ser Leu Val He 
355 360 365 

Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg 
370 375 380 

lie Cvs Ala Cys Leu Trp Met Met Leu Leu He Ala Gin Ala Glu Ala 
385 390 395 400 

Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala Ala Val Ala Gly Ala 



405 



410 



415 



His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys Ala Ala Trp Tyr lie 
420 425 430 

Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala Phe Tyr Gly Val Trp 
435 440 445 

Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro Arg Ala Tyr Ala 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 
- (iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(3) LOCATION: 1..2079 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..2076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

AT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC GGC TTC GCC GAC CTC 

: ; t sn Leu Gly Lys Val lie Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 

1 5 10 15 

GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA GGG GGC GCT GCC AGG 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 

20 25 30 

GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC GGC GTG AAC TAT GCA 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 

35 40 45 

ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC TTC CTC TTG GCT TTG 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 

50 55 60 

CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT GAA GTG CGC AAC GTG 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 

65 70 75 80 



TCC GGG ATG TAC CAT GTC ACG AAC GAC TGC TCC AAC TCA AGC ATT GTG 
Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser lie Val 
85 90 95 




TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC GGG TGC GTG CCC TGC 
Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 



336 



GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA GCG CTC ACC CCC ACG 
Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
115 120 125 



384 



CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG ACA ATA CGA CGC CAC 
Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr Thr He Arg Arg His 
130 135 140 



432 



^GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT TCC GCT ATG TAC GTG 
Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
145 150 155 160 



480 



GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC CAG CTG TTC ACC ATC 
Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser Gin Leu Phe Thr He 
165 170 175 



528 



TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC AAT TGC TCA ATC TAT 57 6 

Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys Asn Cys Ser He Tyr 
1 180 185 190 

CAC ATA ACG GGT CAC CGT ATG GCT TGG GAT ATG ATG ATG AAC 624 
His He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn 
195 200 205 



m :cc 

,n >ro 



GGC 
Gly 



w ;gg 
H :rp 



TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG CTG CTC CGG ATC CCA 
Ser Pro Thr Thr Ala Leu Val Val Ser Gin Leu Leu Arg He Pro 
210 215 220 



672 



:aa 
;in 

225 

;gc 
?iy 



GCT 
Ala 



CTC 
Leu 



GTC 
Val 



GCC 
Ala 



GTG GAC ATG GTG GCG GGG GCC CAT TGG GGA GTC CTG GCG 720 
Val Asp Met Val Ala Gly Ala His Trp Gly Val Leu Ala 
230 235 240 

TAC TAT TCC ATG GTG GGG AAC TGG GCT AAG GTT TTG GTT 768 
Tyr Tyr Ser Met Val Gly Asn Trp Ala Lys Val Leu Val 
245 . 250 255 



GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT ACC CGC GTG TCA GGA 
Val Met Leu Leu Phe Ala Gly Val Asp Gly His Thr Arg Val Ser Gly 
260 265 270 



816 



GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG TCC CTC TTT AGC CCC 
Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
275 280 285 



864 



GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC AAC GGC AGT TGG CAC 
Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr Asn Gly Ser Trp His 
290 295 300 



912 



ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC CTC- CAA ACA GGG TTC 
He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
305 310 315 320 



960 



TTT GCC GCA CTA TTC TAC AAA CAC AAA TTC AAC TCG TCT GGA TGC CCA 
Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
325 330 335 



1008 



GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG TTC GCT CAG GGG TGG 



1056 



Glu 


Arg 


Leu 


Ala 
340 


Ser 


Cys 


Arg 


Ser 


He 
345 


GGT 
Gly 


CCC 
Pro 


CTC 
Leu 
355 


ACT 
Thr 


TAC 
Tyr 


ACT 
Thr 


GAG 
Glu 


CCT 
Pro 
360 


AAC 
Asn 


TGC 
Cys 


TGG 
Trp 
370 


CAC 
His 


TAC 
Tyr 


GCG 
Ala 


CCT 
Pro 


CGA 
Arg 
375 


CCG 
Pro 


TGT 
Cys 


GTG 
Val 
385 


TGC 
Cys 


GGT 
Gly 


CCA 
Pro 


GTG 
Val 


TAT 
Tyr 
390 


TGC 
Cys 


TTC 
Phe 


ACC 
Thr 


ACG 
Thr 

TCG 
Ser 


ACC 
Thr 

GAT 
Asp 


GAT 
Asp 

GTG 
Val 


CGG 
Arg 

CTG 
Leu 
420 


TTT 
Phe 
405 
ATT 
He 


GGT 
Gly 

CTC 
Leu 


GTC 
Val 

AAC 
Asn 


CCC 
Pro 

AAC 
Asn 


ACG 
Thr 

ACG 
Thr 
425 


TTC 
Phe 


GGC 
Gly 


TGT 
Cys 
435 


ACA 
Thr 


TGG 
Trp 


ATG 
Met 


AAT 
Asn 


GGC 
Gly 
440 


ACT 
Thr 


GGC 
Gly 


CCC 
Pro 
450 


CCG 
Pro 


TGC 
Cys 


AAC 
Asn 


ATC 
He 


GGG 
Gly 
455 


GGG 
Gly 


GCC 
Ala 


CCC 
Pro 
465 


ACT 
Thr 


GAC 
Asp 


TGT 
Cys 


TTT 
Phe 


CGG 
Arg 
470 


AAG 
Lys 


CAC 
His 


CCC 
Pro 


GGT 
Gly 


TCT 
Ser 


GGG 
Gly 


CCC 
Pro 


TGG 
Trp 
485 


CTG 
Leu 


ACA 
Thr 


CCT 
Pro 


AGG 
Arg 


AGG 
Arg 


CTC 
Leu 


TGG 
Trp 


CAC 
His 
500 


TAC 
Tyr 


CCC 
Pro 


TGC 
Cys 


ACT 
Thr 


GTC 
Val 
505 


AGG 
Arg 


ATG 
Met 


TAG 
Tyr 
515 


GTG 
Val 


GGG 
Gly 


GGC 
Gly 


GTG 
Val 


GAG 
Glu 
520 


CAC 
His 


TGG 
Trp 


ACT 
Thr 
530 


CGA 
Arg 


GGA 
Gly 


GAG 
Glu 


CGT 
Arg 


TGT 
Cys 
535 


GAC 
Asp 


TTG 
Leu 


CTT 
Leu 
545 


AGC 
Ser 


CCG 
Pro 


CTG 
Leu 


CTG 
Leu 


CTG 
Leu 
550 


TCT 
Ser 


ACA 
Thr 


ACA 
Thr 


TCC 
Ser 


TTC 
Phe 


ACC 
Thr 


ACC 
Thr 


CTG 
Leu 
565 


CCG 
Pro 


GCC 
Ala 


CTA 
Leu 


TCC 
Ser 


CAG 
Gin 


AAC 
Asn 


ATC 
He 


GTG 
Val 
580 


GAC 
Asp 


GTG 
Val 


CAA 
Gin 


TAC 
Tyr 


CTG 
Leu 
585 



Asp 


Lys 


Phe 


Ala 


Gin 
350 


Gly 


Trp 




AGC 
Ser 


TCG 
Ser 


GAC 
Asp 


CAG 
Gin 
365 


AGG 
Arg 


CCC 
Pro 


TAC 
Tyr 


1104 


GGT 
Gly 


ATT 
lie 


GTA 
Val 
380 


CCC 
Pro 


GCG 
Ala 


TCT 
Ser 


CAG 
Gin 


1152 


CCG 
Pro 


AGC 
Ser 
395 


CCT 
Pro 


GTT 
Val 


GTG 
Val 


GTG 
Val 


GGG 
Gly 
400 


1200 


TAT 
Tyr 
410 
CGG 
Arg 


AAC 
Asn 

CCG 
Pro 


TGG 
Trp 

CCG 
Pro 


GGG 
Gly 

CGA 
Arg 


GCG 
Ala 

GGC 
Gly 
430 


AAC 
Asn 
415 
AAC 
Asn 


GAC 
Asp 

TGG 
Trp 


1248 
1296 


GGG 
Gly 


TTC 
Phe 


ACC 
Thr 


AAG 
Lys 
445 


ACG 
Thr 


TGT 
Cys 


GGG 
Gly 


1344 


GGC 
Gly 


AAC 
Asn 


AAC 
Asn 
4 60 


ACC 
Thr 


TTG 
Leu 


ACC 
Thr 


TGC 
Cys 


1392 


GAG 
Glu 


GCC 
Ala 
475 


ACC 
Thr 


TAC 
Tyr 


GCC 
Ala 


AGA 
Arg 


TGC 
Cys 
480 


1440 


TGT 
Cys 
490 


ATG 
Met 


GTT 
Val 


CAT 
His 


TAC 
Tyr 


CCA 
Pro. 
495 


TAT 
Tyr 


1488 


AAC 
Asn 


TTC 
Phe 


ACC 
Thr 


ATC 
He 


TTC 
Phe 
510 


AAG 
Lys 


GTT 
Val 


1536 


AGG 
Arg 


TTC 
Phe 


GAA 
Glu 


GCC 
Ala 
525 


GCA 
Ala 


TGC 
Cys 


AAT 
Asn 


1584 


GAG 
Glu 


GAC 
Asp 


AGG 
Arg 
540 


GAT 
Asp 


AGA 
Arg 


TCA 
Ser 


GAG 
Glu 


1632 


GAG 
Glu 


TGG 
Trp 
555 


CAG 
Gin 


ATA 
He 


CTG 
Leu 


CCC 
Pro 


TGT 
Cys 

C ^ A 

560 


1680 


ACC 
Thr 
570 


GGC 
Gly 


CTG 
Leu 


ATC 
He 


CAC 
His 


CTC 
Leu 
575 


CAT 
His 


1728 


TAC 
Tyr 


GGT 
Gly 


GTA 
Val 


GGG 
Gly 


TCG 
Ser 


GCG 
Ala 


GTT 
Val 


1776 



GTC 


TCC 


CTT 


GTC 


ATC 


AAA 


TGG GAG 


TAT 


GTC 


CTG 


TTG 


CTC 


TTC 


CTT 


CTC 


1824 


Val 


Ser 


Leu 


Val 


He 


Lys 


Trp Glu 


Tyr Val 


Leu 


Leu 


Leu 


Phe 


Leu 


Leu 








595 








600 










605 










CTG 


GCA 


GAC 


GCG 


CGC 


ATC 


TGC GCC 


TGC 


TTA 


TGG 


ATG 


ATG 


CTG 


CTG 


ATA 


1872 


Leu 


Ala 


Aso 


Ala 


Ara 


He 


Cys Ala 


Cys 


Leu Trp Met 


Met 


Leu 


Leu 


He 






610 










615 








620 












GCT 


CAA 


GCT 


GAG 


GCC 


GCC 


TTA GAG 


AAC 


CTG 


GTG 


GTC 


CTC 


AAT 


GCG 


GCG 


1920 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu Glu 


Asn 


Leu 


Val 


Val 


Leu 


Asn 


Ala 


Ala 




625 










630 








635 










640 




GCC 


GTG 


GCC 


GGG 


GCG 


CAT 


GGC ACT 


CTT 




TTC 


CTT 


GTG 


TTC 


TTC 


TGT 


1968 


Ala 


Val 


Ala 


Gly 


Ala 


His 


Gly Thr 


Leu 


Ser 


Phe 


Leu 


Val 


Phe 


Phe 


Cys 












645 








650 










655 






GCT 


GCC 


TGG 


TAC 


ATC 


AAG 


GGC AGG 


Liu 


GTC 


CCT 


GGT 


GCG 


GCA 


TAC 


GCC 


2016 


Ala 


Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg 


Leu 


Val 


Pro Gly 


Ala 


Ala 


Tyr 


Ala 










bbu 








665 










b / U 








TTC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG CTC 


CTG 


CTT 


CTG 


CTG 


GCC 


TTA 


CCA 


CCA 


2064 


Phe 


Tyr 


Gly 


Val 


Trp 


Pro 


Leu Leu 


Leu 


Leu 


Leu 


Leu 


Ala 


Leu 


Pro 


Pro 






675 








680 










685 










OCGA 


GCT 


TAT 


GCC 


TAGTAA 




















2082 


L&rq 


Ala 


Tyr 


Ala 




























690 































5 J(2) INFORMATION FOR SEQ ID NO: 48: 

^ (i) SEQUENCE CHARACTERISTICS: 

s (A) LENGTH: 692 amino acids 

Q (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

^ (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu 
T l 5 10 15 

Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg 
20 25 30 

Ala Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
35 40 45 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala Leu 
50 55 60 

Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Glu Val Arg Asn Val 
65 70 75 80 

Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser Asn Ser Ser He Val 
85 90 95 



Tyr Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys Val Pro Cys 
100 105 110 



Val Arg Glu Asn Asn Ser 
115 



Ser Arg Cys Trp Val Ala Leu Thr Pro Thr 
120 125 



Leu Ala Ala Arg Asn Ala 
130 



Ser Val Pro Thr Thr Thr He Arg Arg His 
135 140 



Val Asp Leu Leu Val Gly 
145 150 



Ala Ala Ala Phe Cys Ser Ala Met Tyr Val 
155 160 



Gly Asp Leu Cys Gly Ser 
165 



Val Phe Leu Val Ser Gin Leu Phe Thr He 
170 175 



Ser Pro Arg Arg His Glu 
180 



Thr Val Gin Asd Cys Asn Cys Ser He Tyr 
185 190 



Pro Gly His He Thr Gly 
195 



His Arg Met Ala Trp Asp Met Met Met Asn 
200 205 



Trp Ser Pro Thr Thr Ala 
210 



Leu Val Val Ser Gin Leu Leu Arg He Pro 
215 220 



Gin Ala Val Val Asp Met 
225 230 



Val Ala Gly Ala His Trp Gly Val Leu Ala 
235 240 



1 Gly Leu Ala Tyr Tyr Ser 

* 245 



Met Val Gly Asn Tru Ala Lys Val Leu Val 
250 255 



Val Met Leu Leu Phe Ala 
260 



Gly Val Asp Gly His Thr Arg Val Ser Gly 
265 270 



Gly Ala Ala Ala Ser Asp 
275 



Thr Arg Gly Leu Val Ser Leu Phe Ser Pro 
280 285 



-s Gly Ser Ala Gin Lys He 
4 290 



Gin Leu Val Asn Thr Asn Gly Ser Trp His 
295 300 



~* He Asn Arg Thr Ala Leu 
J 305 310 



Asn Cys Asn Asp Ser Leu Gin Thr Gly Phe 
315 320 



| Phe Ala Ala Leu Phe Tyr 
325 



Lys His Lys Phe Asn Ser Ser Gly Cys Pro 
330 335 



Glu Arg Leu Ala Ser Cys 
340 



Arg Ser He Asp Lys Phe Ala Gin Gly Trp 
345 350 



Gly Pro Leu Thr Tyr Thr 
355 



Glu Pro Asn Ser Ser Asp Gin Arg Pro Tyr 
360 365 



Cys Trp His Tyr Ala Pro 
370 



Arg Pro Cys Gly He Val Pro Ala Ser Gin 
375 380 



Val Cys Gly Pro Val Tyr 
385 390 



Cys Phe Thr Pro Ser Pro Val Val Val Gly 
395 400 



Thr Thr Asp Arg Phe Gly 
405 



Val Pro Thr Tyr Asn Trp Gly Ala Asn Asp 
410 415 



Ser Asp Val Leu He Leu 
420 



Asn Asn Thr Arg Pro Pro Arg Gly Asn Trp 
425 430 



Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe Thr Lys Thr Cys Gly 



435 440 445 

Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn Asn Thr Leu Thr Cys 
450 455 460 

Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala Thr Tyr Ala Arg Cys 
465 470 475 480 

Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met Val His Tyr Pro Tyr 
485 490 495 

Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe Thr lie Phe Lys Val 
500 505 510 

Arg Met Tyr Val Gly Gly Val Glu His Arg Phe Glu Ala Ala Cys Asn 
515 520 525 

Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp Arg Ser Glu 
530 535 540 

Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp Gin He Leu Pro Cys 
545 550 555 560 

r^Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly Leu He His Leu His 
g 565 570 575 

Sfeln Asn He Val Asp Val Gin Tyr Leu Tyr Gly Val Gly Ser Ala Val 
580 585 590 

%Val Ser Leu Val He Lys Trp Glu Tyr Val Leu Leu Leu Phe Leu Leu 
IJl 595 600 605 

Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp Met Met Leu Leu He 
U 610 615 620 

^Ala Gin Ala Glu Ala Ala Leu Glu Asn Leu Val Val Leu Asn Ala Ala 
" *625 630 635 640 



iSAla Val Ala Gly Ala His Gly Thr Leu Ser Phe Leu Val Phe Phe Cys 
T\ 645 650 655 



Ala Ala Trp Tyr He Lys Gly Arg Leu Val Pro Gly Ala Ala Tyr Ala 
660 665 670 

Phe Tyr Gly Val Trp Pro Leu Leu Leu Leu Leu Leu Ala Leu Pro Pro 
675 680 685 

Arg Ala Tyr Ala 
690 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



% • 

(iii) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1..2430 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 1..2427 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

ATG AGC ACG AAT CCT AAA CCT CAA AGA AAA ACC AAA CGT AAC ACC AAC 48 
Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

CGC CGC CCA CAG GAC GTC AAG TTC CCG GGC GGT GGT CAG ATC GTT GGT 96 
Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

OCGA GTT TAC CTG TTG CCG CGC AGG GGC CCC AGG TTG GGT GTG CGC GCG 144 
SB Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
yQ 35 40 45 

KACT AGG AAG ACT TCC GAG CGG TCG CAA CCT CGT GGG AGG CGA CAA CCT 192 
^Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
?i 50 55 60 

hi' 1 

HPJTC CCC AAG GCT CGC CGA CCC GAG GGT AGG GCC TGG GCT CAG CCC GGG 240 

si He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
n 65 70 75 80 

J TAC CCT TGG CCC CTC TAT GGC AAT GAG GGC ATG GGG TGG GCA GGA TGG 288 
C'-Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
& 85 90 95 

if! 

>|CTC CTG TCA CCC CGC GGC TCT CGG CCT AGT TGG GGC CCT ACA GAC CCC 336 
" 'Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

CGG CGT AGG TCG CGT AAT TTG GGT AAG GTC ATC GAT ACC CTT ACA TGC 384 
Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

: GGC TTC GCC GAC CTC GTG GGG TAC ATT CCG CTC GTC GGC GCC CCC CTA 432 
Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

GGG GGC GCT GCC AGG GCC CTG GCG CAT GGC GTC CGG GTT CTG GAG GAC 480 
Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

GGC GTG AAC TAT GCA ACA GGG AAT TTG CCC GGT TGC TCT TTC TCT ATC 528 
Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

TTC CTC TTG GCT TTG CTG TCC TGT CTG ACC GTT CCA GCT TCC GCT TAT 57 6 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 



180 185 190 

GAA GTG CGC AAC GTG TCC GGG ATG TAG CAT GTC ACG AAC GAC TGC TCC 624 
Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

AAC TCA AGC ATT GTG TAT GAG GCA GCG GAC ATG ATC ATG CAC ACC CCC 672 
Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

GGG TGC GTG CCC TGC GTT CGG GAG AAC AAC TCT TCC CGC TGC TGG GTA 720 
Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

GCG CTC ACC CCC ACG CTC GCA GCT AGG AAC GCC AGC GTC CCC ACC ACG 768 
Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 

ACA ATA CGA CGC CAC GTC GAT TTG CTC GTT GGG GCG GCT GCT TTC TGT 816 
Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

„TCC GCT ATG TAC GTG GGG GAC CTC TGC GGA TCT GTC TTC CTC GTC TCC 864 

ClSer Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 

M 275 280 285 

fHCAG CTG TTC ACC ATC TCG CCT CGC CGG CAT GAG ACG GTG CAG GAC TGC 912 

.^Gln Leu Phe Thr lie Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 

;1 290 295 300 

wjAAT TGC TCA ATC TAT CCC GGC CAC ATA ACG GGT CAC CGT ATG GCT TGG 960 

^JAsn Cys Ser He Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 

* 305 310 315 320 

J5GAT ATG ATG ATG AAC TGG TCG CCT ACA ACG GCC CTG GTG GTA TCG CAG 1008 
rfAsp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
r! 325 330 335 

s'tj 

iflCTG CTC CGG ATC CCA CAA GCT GTC GTG GAC ATG GTG GCG GGG GCC CAT 105 6 

^|Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
■ ; 340 345 350 

TGG GGA GTC CTG GCG GGC CTC GCC TAC TAT TCC ATG GTG GGG AAC TGG 1104 
Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

GCT AAG GTT TTG GTT GTG ATG CTA CTC TTT GCC GGC GTC GAC GGG CAT 1152 
Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

ACC CGC GTG TCA GGA GGG GCA GCA GCC TCC GAT ACC AGG GGC CTT GTG 1200 
Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 
385 390 395 400 

TCC CTC TTT AGC CCC GGG TCG GCT CAG AAA ATC CAG CTC GTA AAC ACC 1248 
Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys He Gin Leu Val Asn Thr 
405 410 415 

AAC GGC AGT TGG CAC ATC AAC AGG ACT GCC CTG AAC TGC AAC GAC TCC 1296 
Asn Gly Ser Trp His He Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 




CTC CAA ACA GGG TTC TTT GCC GCA CTA TTC TAG AAA CAC AAA TTC AAC 134 4 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 

435 440 445 

TCG TCT GGA TGC CCA GAG CGC TTG GCC AGC TGT CGC TCC ATC GAC AAG 1392 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys 

450 455 460 

TTC GCT CAG GGG TGG GGT CCC CTC ACT TAC ACT GAG CCT AAC AGC TCG 14 40 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 

465 470 475 480 

GAC CAG AGG CCC TAC TGC TGG CAC TAC GCG CCT CGA CCG TGT GGT ATT 1488 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly He 

485 490 495 

GTA CCC GCG TCT CAG GTG TGC GGT CCA GTG TAT TGC TTC ACC CCG AGC 1536 

Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 

500 505 510 

CCT GTT GTG GTG GGG ACG ACC GAT CGG TTT GGT GTC CCC ACG TAT AAC 1584 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 

Q 515 520 525 

2 TGG GGG GCG AAC GAC TCG GAT GTG CTG ATT CTC AAC AAC ACG CGG CCG 1632 

5l Trp Gly Ala Asn Asp Ser Asp Val Leu He Leu Asn Asn Thr Arg Pro 

IS 530 535 540 

M CCG CGA GGC AAC TGG TTC GGC TGT ACA TGG ATG AAT GGC ACT GGG TTC 1680 

U] Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 

M 545 550 555 560 

f- s ACC AAG ACG TGT GGG GGC CCC CCG TGC AAC ATC GGG GGG GCC GGC AAC 1728 

'2 Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn lie Gly Gly Ala Gly Asn 

ft 565 570 . 575 

fU AAC ACC TTG ACC TGC CCC ACT GAC TGT TTT CGG AAG CAC CCC GAG GCC 177 6 

%& Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 

Q 580 585 590 

ACC TAC GCC AGA TGC GGT TCT GGG CCC TGG CTG ACA CCT AGG TGT ATG 1824 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 

595 600 605 

GTT CAT TAC CCA TAT AGG CTC TGG CAC TAC CCC TGC ACT GTC AAC TTC 1872 

Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 

610 615 620 

ACC ATC TTC AAG GTT AGG ATG TAC GTG GGG GGC GTG GAG CAC AGG TTC 1920 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 

625 630 635 640 

GAA GCC GCA TGC AAT TGG ACT CGA GGA GAG CGT TGT GAC TTG GAG GAC 1968 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 

645 650 655 

AGG GAT AGA TCA GAG CTT AGC CCG CTG CTG CTG TCT ACA ACA GAG TGG 2016 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 

660 665 670 



CAG 


ATA 


CTG 


CCC 


TGT 


TCC 


TTC 


ACC 


ACC 


CTG 


CCG 


GCC CTA TCC 


ACC 


GGC 


2064 


Gin 


lie 


Leu 


Pro 


Cys 


Ser 


Phe 


Thr 


Thr 


Leu 


Pro Ala Leu Ser Thr Gly 








675 










680 








685 








CTG 


ATC 


CAC 


CTC 


CAT 


CAG 


AAC 


ATC 


GTG 


GAC 


GTG 


CAA TAC CTG 


TAC 


GGT 


2112 


Leu 


lie 


His 


Leu 


His 


Gin 


Asn 


lie 


Val 


Asp 


Val Gin Tyr Leu Tyr Gly 






690 










695 










700 








GTA 


GGG 


TCG 


GCG 


GTT 


GTC 


TCC 


CTT 


GTC 


ATC 


AAA 


TGG GAG TAT 


GTC 


CTG 


2160 


Val 


Gly 


Ser 


Ala 


Val 


Val 


Ser 


Leu 


Val 


He 


Lys Trp Glu Tyr Val Leu 




705 










710 










715 






720 




TTG 


CTC 


TTC 


CTT 


CTC 


CTG 


GCA 


GAC 


GCG 


CGC 


ATC 


TGC GCC TGC 


TTA 


TGG 


2208 


Leu 


Leu 


Phe 


Leu 


Leu 


Leu 


Ala 


Asp 


Ala 


Arg 


He 


Cys Ala Cys 


Leu Trp 












725 










730 






735 






ATG 


ATG 


CTG 


CTG 


ATA 


GCT 


CAA 


GCT 


GAG 


GCC 


GCC 


TTA GAG AAC 


CTG 


GTG 


2256 


Met 


Met 


Leu 


Leu 


He 


Ala 


Gin 


Ala 


Glu 


Ala 


Ala 


Leu Glu Asn 


Leu 


Val 










740 










745 






750 








GTC 


CTC 


AAT 


GCG 


GCG 


GCC 


GTG 


GCC 


GGG 


GCG 


CAT 


GGC ACT CTT 


TCC 


TTC 


2304 


Val 


Leu 


Asn 


Ala 


Ala 


Ala 


Val 


Ala 


Gly 


Ala 


His 


Gly Thr Leu 


Ser 


Phe 








755 










760 








765 








Q 

IB CTT 


GTG 


TTC 


TTC 


TGT 


GCT 


GCC 


TGG 


TAC 


ATC 


AAG 


GGC AGG CTG 


GTC 


CCT 


2352 


/r? Leu 


Val 


Phe 


Phe 


Cys 


Ala 


Ala 


Trp 


Tyr 


He 


Lys 


Gly Arg Leu Val 


Pro 






770 










775 










780 








^GGT 


GCG 


GCA 


TAC 


GCC 


TTC 


TAT 


GGC 


GTG 


TGG 


CCG 


CTG CTC CTG 


CTT 


CTG 


2400 


"^Gly 


Ala 


Ala 


Tyr 


Ala 


Phe 


Tyr 


Gly 


Val 


Trp 


Pro 


Leu Leu Leu 


Leu 


Leu 




111785 










790 










795 






800 




SJCTG 


GCC 


TTA 


CCA 


CCA 


CGA 


GCT 


TAT 


GCC 


TAGTAA 








2433 


s Leu 


Ala 


Leu 


Pro 


Pro 


Arg 


Ala 


Tyr 


Ala 






















805 










810 













(2) INFORMATION FOR SEQ ID NO: 50: 

y3 (i) SEQUENCE CHARACTERISTICS: 

M _ (A) LENGTH: 809 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn 
15 10 15 

Arg Arg Pro Gin Asp Val Lys Phe Pro Gly Gly Gly Gin He Val Gly 
20 25 30 

Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala 
35 40 45 

Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro 
50 55 60 



He Pro Lys Ala Arg Arg Pro Glu Gly Arg Ala Trp Ala Gin Pro Gly 
65 70 75 80 




Tyr Pro Trp Pro Leu Tyr Gly Asn Glu Gly Met Gly Trp Ala Gly Trp 
85 90 95 

Leu Leu Ser Pro Arg Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro 
100 105 110 

Arg Arg Arg Ser Arg Asn Leu Gly Lys Val He Asp Thr Leu Thr Cys 
115 120 125 

Gly Phe Ala Asp Leu Val Gly Tyr He Pro Leu Val Gly Ala Pro Leu 
130 135 140 

Gly Gly Ala Ala Arg Ala Leu Ala His Gly Val Arg Val Leu Glu Asp 
145 150 155 160 

Gly Val Asn Tyr Ala Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He 
165 170 175 

Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr 
180 185 190 

Glu Val Arg Asn Val Ser Gly Met Tyr His Val Thr Asn Asp Cys Ser 
195 200 205 

Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr Pro 
210 215 220 

Gly Cys Val Pro Cys Val Arg Glu Asn Asn Ser Ser Arg Cys Trp Val 
225 230 235 240 

Ala Leu Thr Pro Thr Leu Ala Ala Arg Asn Ala Ser Val Pro Thr Thr 
245 250 255 

Thr He Arg Arg His Val Asp Leu Leu Val Gly Ala Ala Ala Phe Cys 
260 265 270 

Ser Ala Met Tyr Val Gly Asp Leu Cys Gly Ser Val Phe Leu Val Ser 
275 280 285 

Gin Leu Phe Thr He Ser Pro Arg Arg His Glu Thr Val Gin Asp Cys 
• 290 295 300 

Asn Cys Ser lie Tyr Pro Gly His He Thr Gly His Arg Met Ala Trp 
305 310 315 320 

Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu Val Val Ser Gin 
325 330 335 

Leu Leu Arg He Pro Gin Ala Val Val Asp Met Val Ala Gly Ala His 
340 345 350 

Trp Gly Val Leu Ala Gly Leu Ala Tyr Tyr Ser Met Val Gly Asn Trp 
355 360 365 

Ala Lys Val Leu Val Val Met Leu Leu Phe Ala Gly Val Asp Gly His 
370 375 380 

Thr Arg Val Ser Gly Gly Ala Ala Ala Ser Asp Thr Arg Gly Leu Val 



385 



390 



400 



• 



Ser Leu Phe Ser Pro Gly Ser Ala Gin Lys lie Gin Leu Val Asn Thr 
405 410 415 

Asn Gly Ser Trp His lie Asn Arg Thr Ala Leu Asn Cys Asn Asp Ser 
420 425 430 

Leu Gin Thr Gly Phe Phe Ala Ala Leu Phe Tyr Lys His Lys Phe Asn 
435 440 445 

Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Ser lie Asp Lys 
450 455 460 

Phe Ala Gin Gly Trp Gly Pro Leu Thr Tyr Thr Glu Pro Asn Ser Ser 
465 470 475 480 

Asp Gin Arg Pro Tyr Cys Trp His Tyr Ala Pro Arg Pro Cys Gly lie 
485 490 495 

Val Pro Ala Ser Gin Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser 
500 505 510 

Pro Val Val Val Gly Thr Thr Asp Arg Phe Gly Val Pro Thr Tyr Asn 
515 520 525 

fir; Trp Gly Ala Asn Asp Ser Asp Val Leu lie Leu Asn Asn Thr Arg Pro 
*S 530 535 540 

K 3 

It Pro Arg Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Gly Thr Gly Phe 
545 550 555 560 

111 Thr Lys Thr Cys Gly Gly Pro Pro Cys Asn He Gly Gly Ala Gly Asn 
S| 565 570 575 

Asn Thr Leu Thr Cys Pro Thr Asp Cys Phe Arg Lys His Pro Glu Ala 
% 580 585 590 

Thr Tyr Ala Arg Cys Gly Ser Gly Pro Trp Leu Thr Pro Arg Cys Met 
fU 595 600 605 

%.* Val His Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr Val Asn Phe 
610 615 620 

Thr He Phe Lys Val Arg Met Tyr Val Gly Gly Val Glu His Arg Phe 
625 630 635 640 

Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys Asp Leu Glu Asp 
645 650 655 

Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Ser Thr Thr Glu Trp 
660 665 670 

Gin He Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala Leu Ser Thr Gly 
675 680 685 

Leu He His Leu His Gin Asn He Val Asp Val Gin Tyr Leu Tyr Gly 
690 695 700 

Val Gly Ser Ala Val Val Ser Leu Val He Lys Trp Glu Tyr Val Leu 
705 710 715 720 

Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg He Cys Ala Cys Leu Trp 



725 



Met 


Met 


Leu 


Leu 
740 


lie 


Ala 


Gin 


Ala 


Val 


Leu 


Asn 
755 


Ala 


Ala 


Ala 


Val 


Ala 
760 


Lea 


Val 
770 


Phe 


Phe 


Cys 


Ala 


Ala 
775 


Trp 


Gly Ala 
785 

Leu Ala 


Ala 
Leu 


Tyr 
Pro 


Ala 

Pro 
805 


Phe 
790 
Arg 


Tyr 
Ala 


Gly 
Tyr 



• 



730 735 

Glu Ala Ala Leu Glu Asn Leu Val 
745 750 

Gly Ala His Gly Thr Leu Ser Phe 
765 

Tyr lie Lys Gly Arg Leu Val Pro 
780 



Val Trp Pro Leu Leu Leu Leu Leu 
795 800 

Ala 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1. .17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Ser Asn Ser Ser Glu Ala Ala Asp Met He Met His Thr Pro Gly Cys 
15 10 15 

Val 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1. .22 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Gly Gly He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp 



% 



ft 



10 15 



Ser Pro Thr Thr Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..37 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Tyr GIu Val Arg Asn Val Ser Gly He Tyr His Val Thr Asn Asp Cys 
i 1 5 10 15 

Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 
[ 20 25 30 

l Pro Gly Cys Gly Lys 
35 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 

(B) LOCATION: 1..25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Gly Gly Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
15 10 15 

Gin Leu Arg Arg His He Asp Leu Leu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modif ied-site 

(B) LOCATION: 1..25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly Gly Thr Pro Thr Leu Ala Ala Arg Asp Ala Ser Val Pro Thr Thr 
15 10 15 

Thr lie Arg Arg His Val Asp Leu Leu 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Tyr Gin Val Arg Asn 
15 10 15 

Ser Thr Gly Leu • 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr Asn Asp Cys Pro 
15 10 15 

Asn Ser Ser lie 



20 



20 



20 



(2) INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala His Asp Ala He 
15 10 15 

Leu His Thr Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



K (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

» Ser Asn Ser Ser He Val Tyr Glu Ala Ala Asp Met He Met His Thr 

n 15 10 15 

Pro Gly Cys Val 

H= 20 

fiM2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 
■ * (A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
f (C) STRANDEDNESS: single 

S (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

His Asp Ala He Leu His Thr Pro Gly Val Pro Cys Val Arg Glu Gly 
15 10 15 

Asn Val Ser 



(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH : 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Cys Val Arg Glu Gly Asn Val Ser Arg Cys Trp Val Ala Met Thr Pro 
15 10 15 

Thr Val Ala Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Cl Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys Leu Pro Ala Thr 
1 5 10 15 

Gin Leu Arg Arg 

51(2) INFORMATION FOR SEQ ID NO: 63: 

y (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

Leu Pro Ala Thr Gin Leu Arg Arg His lie Asp Leu Leu Val Gly Ser 
15 10 15 



Ala Thr Leu Cys 
20 

(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Leu Val Gly Ser Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu 
15 10 15 

Cys Gly Ser Val 
20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

=, (ii) MOLECULE TYPE: peptide 



Rj (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

G Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr Thr Gin Gly Cys 
ml 5 10 15 

Asn Cys Ser lie 

^y(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
\% (A) LENGTH: 20 amino acids 

^ (B) TYPE: amino acid 

^ (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
15 10 15 

^ Arg Met Ala Trp 
20 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



% % 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

He Thr Gly His Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro 
15 10 15 

Thr Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



2i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

W Asn Trp Ser Pro Thr Ala Ala Leu Val Met Ala Gin Leu Leu Arg He 
\| i 5 10 15 

C§ Pro Gin Ala He 
20 

^(2) INFORMATION FOR SEQ ID NO: 69: 

H; (i) SEQUENCE CHARACTERISTICS : 

Bi (A) LENGTH : 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
" (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 

Leu Leu Arg He Pro Gin Ala He Leu Asp Met He Ala Gly Ala His 
15 10 15 

Trp Gly Val Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
1 5 10 15 

Val Gly Asn Met 
20 



(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Val Val Leu Leu Leu Phe Ala Gly Val Asp Ala Glu Thr He Val Ser 
15 10 15 

Gly Gly Gin Ala 
20 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Ser Gly Leu Val Ser Leu Phe Thr Pro Gly Ala Lys Gin Asn He Gin 
15 10 15 

Leu He Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



un/cwv _T 



% % 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Gin Asn He Gin Leu He Asn Thr Asn Gly Gin Trp His He Asn Ser 
15 10 15 

Thr Ala Leu Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 



[ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Leu Asn Cys Asn Glu Ser Leu Asn Thr Gly Trp Trp Leu Ala Gly Leu 

\ 1 5 10 15 

! He Tyr Gin His Lys 

I 20 

% (2) INFORMATION FOR SEQ ID NO: 75: 

I (i) SEQUENCE CHARACTERISTICS : 
l ~ (A) LENGTH: 20 amino acids 

| (B) TYPE: amino acid 

\ (C) STRANDEDNESS : single 

= (D) TOPOLOGY: linear 

. (ii) MOLECULE TYPE; peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Ala Gly Leu He Tyr Gin His Lys Phe Asn Ser Ser Gly Cys Pro Glu 
15 10 15 

Arg Leu Ala Ser 
20 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 




% 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg Pro Leu Thr Asp Phe Asp 
15 10 15 

Gin Gly Trp Gly 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn Gly Ser 
15 10 15 

Gly Pro Asp Gin 



% i2) INFORMATION FOR SEQ ID NO: 78: 

Q (i) SEQUENCE CHARACTERISTICS: 

yg (A) LENGTH: 20 amino acids 

lZ (B) TYPE: amino acid 

IT* (C) STRANDEDNESS: single 

[% (D) TOPOLOGY: linear 

'"4 (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Ala Asn Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro 
15 10 15 

Pro Lys Pro Cys 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



20 



20 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp His Tyr Pro Pro Lys Pro Cys Gly He Val Pro Ala Lys Ser Val 
1-5 10 15 

Cys Gly Pro Val 
20 

(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

Ala Lys Ser Val Cys Gly Pro Val Tyr Cys Phe Thr Pro Ser Pro Val 
15 10 15 

Val Val Gly Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala Pro Thr 
15 10 15 

Tyr Ser Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Gly Ala Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val 
15 10 15 

Leu Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Gly Asn Trp Phe Gly Cys Thr Trp Met Asn Ser Thr Gly Phe Thr Lys 
O 1 5 10 • 15 

^3 Val Cys Gly Ala 

5^(2) INFORMATION FOR SEQ ID NO: 84: 

iJI (i) SEQUENCE CHARACTERISTICS: 
"4 (A) LENGTH: 20 amino acids 

5 (B) TYPE: amino acid 

r% (C) STRANDEDNESS: single 

% (D) TOPOLOGY: linear 

r! (ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Val Cys lie Gly Gly Ala 
15 10 15 

Gly Asn Asn Thr 
20 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 



lie Gly Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Arg 
15 10 15 

Lys His Pro 



(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

Thr Asp Cys Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly 
1 5 10 15 

Ser Gly Pro TrD 
20* 

(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

Ser Arg Cys Gly Ser Gly Pro Trp He Thr Pro Arg Cys Leu Val Asp 
15 10 15 

Tyr Pro Tyr Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys Thr He 





1 



5 



15 



Asn Tyr Thr He 
20 



(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

Pro Cys Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly 
15 10 15 

Gly Val Glu His 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Met Tyr Val Gly Gly Val Glu His Arg Leu Glu Ala Ala Cys Asn Trp 
15 10 15 

Thr Pro Gly Glu 



(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

Ala Cys Asn Trp Thr Pro Gly Glu Arg Cys Asp Leu Glu Asp Arg Asp 



20 



^{2} INFORMATION FOR SEQ ID NO: 90: 



20 



1 



15 




% 



Arg Ser Glu Leu 
20 



(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr Thr Thr 
1 5 10 15 

Gin Trp Gin Val 



(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Tyr Gin Val Arg Asn Ser Thr Gly Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: YES 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 
ACGTCCGTAC GTTCGAATTA ATTAATCGA 



29 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE : YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95; 
CCTCCGGACG TGCACTAGCT CCCGTCTGTG GTAGTGGTGG TAGTGATTAT CAATTAATTG 60 



i(2) INFORMATION FOR SEQ ID NO: 96: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
GTTTAACCAC TGCATGATG 19 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 




GTCCCATCGA GTGCGGCTkC 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGTGACATGG TACATTCCGG ACACTTGGCG CACTTCATAA GCGGA 

P(2) INFORMATION FOR SEQ ID NO: 99: 

S (i) SEQUENCE CHARACTERISTICS: 

m (A) LENGTH: 42 base pairs 

^ (B) TYPE: nucleic acid 

fi (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

u = 

SI (ii) MOLECULE TYPE: DNA (genomic) 

Q (iii) HYPOTHETICAL: NO 

'fl (iii) ANTI-SENSE: NO 



'4 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
^TGCCTCATAC ACAATGGAGC TCTGGGACGA GTCGTTCGTG AC 
(2) INFORMATION FOR SEQ ID NO: 100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 



TACCCAGCAG CGGGAGCTCT GTTGCTCCCG AACGCAGGGC AC 



(2) INFORMATION FOR SEQ ID NO: 101: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

TGTCGTGGTG GGGACGGAGG CCTGCCTAGC TGCGAGCGTG GG 
(2) INFORMATION FOR SEQ ID NO: 102: 

Q (i) SEQUENCE CHARACTERISTICS: 
m (A) LENGTH: 48 base pairs 

."4 (B) TYPE: nucleic acid 

r: (C) STRANDEDNESS: single 

j*| (D) TOPOLOGY: linear 

S§ (ii) MOLECULE TYPE: DNA (genomic) 

ij (iii) HYPOTHETICAL: NO 

U (iii) ANTI-SENSE: NO 



III (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 102: 
rJcGTTATGTGG CCCGGGTAGA TTGAGCACTG GCAGTCCTGC ACCGTCTC 
(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
i| (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
CAGGGCCGTT CTAGGCCTCC ACTGCATCAT CATATCCCAA GC 




(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
{iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 
CCGGAATGTA CCATGTCACG AACGAC 26 
(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

□ (C) STRANDEDNESS: single 

g (D) TOPOLOGY: linear 

7i (ii) MOLECULE TYPE: DNA (genomic) 

fe ! (iii) HYPOTHETICAL : NO 

(iii) ANTI-SENSE: NO 
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;S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

HpCTCCATTGT GTATGAGGCA GCGG 24 
£1(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 
GAGCTCCCGC TGCTGGGTAG CGC 23 
(2) INFORMATION FOR SEQ ID NO: 107: 



• 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
CCTCCGTCCC CACCACGACA ATACG 25 
(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
CTACCCGGGC CACATAACGG GTCACCG 27 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
GGAGGCCTAC AACGGCCCTG GTGG 24 
(2) INFORMATION FOR SEQ ID NO: 110: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
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*(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
TTCTATCGAT TAAATAGAAT TC 22 
(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
B (iii) HYPOTHETICAL: NO 
5 (iii) ANTI-SENSE: NO 

H (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

, GCCATACGCT CACAGCCGAT CCC 23 
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